Description

Job Titile - MLOps Platform Engineer role.

Location - Bangalore

Notice Period - Immediate Joiner/ 30days 

Responsibilities:

MLOps infrastructure Management and Monitoring
Proficient in infrastructure provisioning using OpenShift, Ansible or Terraform
Deploying a Model for inference at production scale
Building an enterprise-wide artificial intelligence infrastructure
Design and implement MLOps processes and pipelines
Data science model review, containerization, deployment, versioning, and monitoring of its quality
Configuring, Managing and administration of NVIDIA GPU containers, GPUDirect Storage 
Provision, automate, support, and manage the wide spectrum of Dell Technologies products and services

Essential Requirements:

Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
Should have over 12+ years in software development experience
5+ years of extensive experience in AIOps/MLOps platforms like MLFlow, cnvrg.io, Domino, Kubeflow, H2o.ai, run.ai or MLOps on Cloud.
Work experience in Dell products like PowerStore, PowerScale, Networking, data center skills 
Strong Experience in Docker, Kubernetes and IaC platforms like Ansible, Terraform and OpenShift
Has experience in INVIDIA Bright Cluster Manager, DGX H100/A100 Administration
Experience in NVIDIA AI Enterprise knowledge (NVIDIA Metropolis, Riva, NeMo, Morpheus, Merlin etc.) 
Automation skills in using Python, Shell
Experience with cloud-based data platforms such as AWS, Google Cloud, or Azure

Desired Requirements

Should coordinate and communicate with Data Engineers and Data Scientists
Knowledge of frameworks such as scikit-learn, Keras, PyTorch, Tensorflow, etc., 
Proficient in Python, Pandas, and other commonly used Python packages for data science
Exposure to Generative AI models (OpenAI or equivalent)

Education

Bachelor's or Master's degree in Computer Science, Engineering