Machine Learning Engineer, ML Platform
1 month ago
Role Overview:
We are seeking a skilled professional to join our Machine Learning Platform team, focused on building advanced tools and infrastructur..
Role Overview:
We are seeking a skilled professional to join our Machine Learning Platform team, focused on building advanced tools and infrastructure to support machine learning initiatives. The role involves designing and implementing scalable AI infrastructure, developing observability solutions, and fostering the adoption of distributed computing frameworks within the organization.
Key Responsibilities:
- Develop AI Infrastructure SolutionsCollaborate with the ML Platform team to design and implement scalable infrastructure for distributed data processing and model training.
Utilize GitOps practices to maintain reproducibility across Kubernetes clusters.
- Build Monitoring and Alerting SystemsDevelop and integrate observability solutions using tools like Datadog, Prometheus, and Grafana.
Create runbooks and DevOps guides to streamline operations.
- Support Distributed Computing Framework AdoptionAssist data science teams in leveraging distributed computing frameworks such as Ray for scalable workloads.
Advocate for best practices and provide technical support for cluster utilization.
Qualifications:
- Strong expertise in ML-Ops, with experience in distributed computing frameworks (e.g., Ray, Dask, Modin, Beam, or Horovod).
- Proficient in Python and familiar with machine learning tools and ecosystems.
- Hands-on experience with Kubernetes, including GitOps tools (e.g., ArgoCD) and configuration management solutions like Helm and Kustomize.
- Solid DevOps background, with knowledge of Infrastructure as Code (e.g., Terraform).
- Excellent written and verbal communication skills to support cross-functional collaboration.
This role offers the opportunity to work on innovative projects, drive machine learning infrastructure advancements, and contribute to the organization's data science capabilities.
Next Step:
Prepare your updated resume (please include your current salary package with full breakdown such as base, incentives, annual wage supplement, etc.) and expected package. Simply click on 'Apply here' to drop your resume or email at [email protected].
Susmita Sahu
EA License No: 91C2918
Personnel Registration Number: R23114076
Official account of Jobstore.