Job Summary
Key Responsibilities
2. Design and implement distributed data processing workflows with Apache Spark and Kafka to support real-time and batch ML model operations.
3. Develop robust data pipelines and feature engineering processes using pandas, NumPy, and Apache Airflow to optimize model performance and data quality.
4. Oversee the development and validation of machine learning models for NLP, deep learning, and time series forecasting, applying advanced techniques and frameworks such as XGBoost and LightGBM.
5. Define and enforce architectural standards for model storage, versioning, and reproducibility using MySQL, PostgreSQL, and DataBricks.
6. Mentor team members on AI/ML best practices and emerging technologies, ensuring continuous skill enhancement and technical excellence.
7. Collaborate with internal stakeholders to gather requirements and translate business needs into technical specifications for AI/ML solutions.
8. Evaluate and integrate new tools and technologies to maintain solution relevance and meet evolving client requirements.
9. Architect and implement RESTful API integrations to enable seamless communication between AI/ML components and external systems, ensuring scalable, secure, and efficient data exchange across diverse enterprise environments.
Skill Requirements
2. Excellent Knowledge Of Distributed Data Processing With Apache Spark And Kafka.
3. Advanced Skills In Data Engineering, Feature Extraction, And Pipeline Automation Using Pandas, Numpy, And Apache Airflow.
4. Solid Understanding Of Classical Machine Learning, Deep Learning, Nlp, And Time Series Forecasting Techniques.
5. Indepth Experience With Relational Databases Such As Mysql And Postgresql For Data Management And Model Storage.
6. Strong Ability To Architect Scalable Solutions Integrating Multiple Data Sources And Ml Frameworks.
7. Excellent Communication And Mentoring Skills To Guide Technical Teams.
Other Requirements
2. AWS Certified Machine Learning � Specialty
3. Databricks Certified Data Engineer Professional (optional but valuable)