Job Summary
Key Responsibilities
2. Managing data pipelines, etl processes, data lakes, and warehouses within snowflake and azure data factory.
3. Collaborating with cross functional teams to gather business requirements and translate them into technical solutions.
4. Troubleshooting and resolving any issues related to snowflake, azure data factory (adf), and data bricks performance and functionality.
5. Implementing security measures and access controls to protect sensitive data stored in these platforms.
6. Monitoring system performance, identifying bottlenecks, and implementing optimizations to improve overall efficiency.
Skill Requirements
Must have skills :- Required Skills and Qualifications Bachelor’s degree in Computer Science, Information Systems, Engineering, Data Science, or a related field.Minimum 10+ years of experience in data engineering, cloud data platforms, or enterprise analytics solutions.Strong hands-on experience with Microsoft Azure data and analytics services, including Azure Databricks, Azure Data Factory, Azure Data Lake Storage (ADLS Gen2), Azure SQL, and Azure Functions.Strong experience designing and implementing ETL/ELT pipelines and enterprise data integration solutions.Strong command of both relational (SQL Server, PostgreSQL) and NoSQL database systems is expectedUnderstanding of Lakehouse and Medallion architecture principles for enterprise analytics platforms.Hands-on experience with Apache Spark and PySpark for distributed data processing and transformation.Advanced skills in Python are required, including use of data manipulation libraries (Pandas, NumPy) and API integrationThe candidate should demonstrate experience in designing dimensional data models (star/snowflake schemas)Ability to design and consume RESTful APIs to integrate data from upstream systemsKnowledge of visualization tools such as Power BI or Tableau is expected to collaborate effectively with dashboard developers and translate business requirements into well-structured data models that support visualization layers. Good to have skills : - Knowledge of how to structure and prepare data for ML model consumption, including feature engineering, data labeling pipelines, and integration with AI services (e.g., Azure OpenAI, Azure ML)Experience working in large enterprise or regulated environments with strong security and compliance requirements is preferred.Recommended CertificationsMicrosoft Certified: Azure Data Engineer AssociateMicrosoft Certified: Fabric Analytics Engineer AssociateMicrosoft Certified: Azure Solutions Architect ExpertMicrosoft Certified: Azure Developer AssociateDatabricks Certified Data Engineer Associate or ProfessionalPersonal CompetenciesExceptional attention to detail: data accuracy is non-negotiableProactive communication: ability to flag data issues, pipeline risks, or scope concerns early and clearlyStructured, self-directed working style: ability to manage concurrent workstreams across initiative phases with minimal supervisionCollaborative approach: the role requires productive engagement with data owners across six institutional teams from three WBG institutionsComfort with ambiguity: the ability to work constructively with incomplete data landscapes while building toward clarity