Job Summary
Key Responsibilities
2. Apply DevOps practices with Jenkins, GitLab CI/CD, CircleCI, and GitHub Actions to streamline CI/CD for machine learning workflows and monitor pipeline health.
3. Utilize infrastructure-as-code tools such as Terraform and AWS CloudFormation to provision and manage scalable cloud resources for ML workloads.
4. Integrate monitoring solutions like Prometheus, Grafana, ELK Stack, and Fluentd to track model performance, system metrics, and log analytics in production environments.
5. Ensure process compliance by using Git, GitHub, GitLab, and Bitbucket for version control and code management within the team.
6. Participate in technical discussions and feasibility studies to evaluate technical alternatives and support architecture best practices for ML Ops solutions.
7. Prepare and submit status reports to highlight progress, minimize risks, and support project closure activities.
Skill Requirements
2. Solid Understanding Of Devops Tools Such As Jenkins, Gitlab Ci/Cd, Circleci, And Github Actions For Workflow Automation.
3. Solid Experience With Python For Scripting, Data Processing, And Ml Pipeline Development.
4. Solid Knowledge Of Infrastructureascode Tools Like Terraform And Aws Cloudformation For Cloud Resource Management.
5. Solid Skills In Monitoring And Logging Tools Including Prometheus, Grafana, Elk Stack, And Fluentd.
6. Solid Familiarity With Version Control Systems Such As Git, Github, Gitlab, And Bitbucket.
7. Solid Ability To Participate In Technical Discussions And Support Process Compliance Within The Team.
Other Requirements
2. AWS Certified DevOps Engineer
3. - Google Professional Machine Learning Enginee