Description

  • Bachelors or higher degree in Computer Science or equivalent major or equivalent experience
  • 9+ years professional software engineering experience
  • 3+ years specialized experience in AI/ML infrastructure, e.g., enabling and optimizing distributed training for scaling large ML models
  • Experience with Python, with proficiency in frameworks such as PyTorch, TensorFlow, or similar (with the goal of optimizing and supporting infrastructure required to utilize such frameworks)
  • Experience with Infrastructure as Code (IaC) used to design, build, and manage the infrastructure required to support AI/ML platforms and workloads such as Terraform (preferred), Cloud Formation, Pulumi, etc.
  • Experience with distributed computing, GPU computing, and cloud environments – GCP (preferred), AWS, or Azure.
  • Comfortable working in highly ambiguous and dynamic environments

Education

Bachelor's degree