Job Summary
To be responsible for managing technology in complex projects ,providing technical guidance and ensuring successful delivery of solutions.
Data Engineer - Databricks Job Description
The Data Engineer - Databricks will be responsible for building and optimizing our data pipelines, architectures, and data sets. He will work closely with data scientists, analysts, and other engineers to support their data needs and maximize the value of our data processing capabilities.
Responsibilities
• Design, develop, and maintain scalable and robust data pipelines on Databricks.
• Collaborate with data scientists and analysts to understand data requirements and deliver solutions.
• Optimize and troubleshoot existing data pipelines for performance and reliability.
• Ensure data quality and integrity across various data sources.
• Implement data security and compliance best practices.
• Monitor data pipeline performance and conduct necessary maintenance and updates.
• Document data pipeline processes and technical specifications.
Qualifications
• A bachelor’s degree in computer science, Engineering, or a closely related discipline is required.
• 5+ years of experience in data engineering.
• Proficiency with Databricks, Python and Spark.
• Strong SQL skills and experience with relational databases.
• Experience with big data technologies (e.g., Hadoop, Kafka).
• Knowledge of data warehousing concepts and ETL processes.
• Excellent problem-solving and analytical skills.
Key Responsibilities
2. To conduct comprehensive code reviews, establish and oversee quality assurance processes, performance optimization , implementation of best practices and coding standards to ensure successful delivery of complex projects.
3. To ensure process compliance in the assigned module| and participate in technical discussions/review as a technical consultant for feasibility study (technical alternatives, best packages, supporting architecture best practices, technical risks, breakdown into components, estimations).
4. To collaborate with stakeholders to define project scope, objectives, deliverables and accordingly prepare and submit status reports for minimizing exposure & closure of escalations.