Job Summary
6+ Years of strong experience in SQL Server
• 3+ Experience in PySpark and big data processing
• 3+ Working knowledge of AWS (S3, Glue, EMR, Redshift or similar services)
• Good hands-on Experience with Azure Data Factory / Databricks
• Good hands-on experience in Python
• Good hands-on experience of data warehousing and data modelling
• Good hands-on experience of ETL processes and data pipelines
• Basic knowledge of AI/ML concepts (data preprocessing, model basics)
• Exposure to machine learning tools/libraries (like Pandas, Scikit-learn)
• Familiarity with Agile way of working
Key Responsibilities
2. Design and develop efficient and reliable etl processes for large datasets.
3. Collaborate with cross functional teams to understand business requirements and translate them into technical solutions.
4. Optimize data workflows, troubleshoot issues, and ensure data quality and integrity.
5. Implement best practices for data security, governance, and compliance.
6. Provide technical guidance, mentoring, and support to junior team members.
7. Stay uptodate with the latest trends and technologies in data engineering and analytics.
Skill Requirements
2. Strong skills in writing complex sql queries for data manipulation and analysis.
3. Experience with oracle pl/sql for database development and management.
4. Proficient in python programming language for automation and scripting tasks.
5. Solid understanding of data warehousing concepts, etl processes, and data modeling.
6. Ability to work in a fast paced environment and manage multiple priorities effectively.
7. Excellent problem-solving skills and attention to detail.
8. Strong communication and interpersonal skills for effective collaboration with team members and stakeholders.