Job Description:
Skilled Machine Learning Engineer with strong experience in AWS cloud services, and proven ability in integration and development of ML solutions. The ideal candidate will collaborate closely with data scientists, developers, and DevOps teams to design, deploy, and maintain scalable machine learning systems in production.
Key Responsibilities:
1. Design, develop, and deploy machine learning models in production environments using AWS services (eg, SageMaker, Lambda, EC2, S3, EKS, etc.).
2. Build scalable data pipelines for training and inference using AWS Glue, Step Functions, and other ETL tools.
3. Integrate ML models into existing systems and APIs, ensuring seamless operation across multiple services and platforms.
4. Collaborate with cross-functional teams to gather requirements and implement ML solutions aligned with business goals.
5. Optimize and monitor deployed models for performance, latency, and cost-effectiveness using tools such as CloudWatch, CloudTrail, and Prometheus.
6. Write clean, maintainable code in Python (and optionally Java/Scala) following best software engineering practices.
7. Automate model training, validation, and deployment workflows using CI/CD pipelines (eg, CodePipeline, Jenkins, GitHub Actions).
8. Ensure security and compliance in data handling and model deployment ...