Description

Key Responsibilities
Design, build, and maintain automated deployment pipelines for machine learning models and applications.
Collaborate with data scientists, software engineers, and other stakeholders to deploy machine learning models into production environments.
Develop and implement CI/CD processes to streamline and automate the deployment and testing of machine learning solutions.
Manage and optimize cloud-based infrastructure on both Azure and AWS platforms.
Utilize Terraform for infrastructure as code to ensure scalable and reliable systems.
Monitor and troubleshoot production systems, ensuring high availability and performance of machine learning applications.
Stay up-to-date with emerging technologies and industry best practices related to machine learning operations.

Qualifications
Bachelor’s or Master’s degree in Computer Science, Information Technology, or related field.
Proven experience in managing machine learning operations in a production environment.
Strong expertise in Azure and AWS cloud platforms, with hands-on experience in deploying and managing resources in both environments.
Proficiency in infrastructure as code tools, particularly Terraform.
Experience in designing and implementing CI/CD pipelines for machine learning applications.
Strong scripting and automation skills with Python or Shell Scripting
Excellent problem-solving skills and the ability to troubleshoot complex systems.
Solid understanding of software development lifecycle and version control systems.
Excellent communication skills and the ability to work collaboratively in a team environment

Education

Bachelor's or Master's degrees