Description

Key Responsibilities

Migrate and refactor existing deep learning codebases to production-ready states

Improve overall code quality, reliability, and maintainability

Expand unit and integration test coverage

Optimize backend workflows and data pipelines

Contribute to dashboard creation, monitoring, and alerts

Support distributed training, efficient data loading, and scaling of deep learning models

Collaborate with cross-functional teams to transform research into product

Participate in on-call support rotation for engineering infrastructure

Required Qualifications

6+ years of experience in Python, working with large codebases

3+ years of hands-on experience training deep learning models using PyTorch

Expertise in CI/CD practices, unit testing, and integration testing

Proven experience in code optimization, especially for ML/DL workloads

Familiarity with GPU programming and backend ML systems

Strong communication and problem-solving skills

Ability to work independently and in collaborative, cross-functional environments

Preferred Qualifications

Experience maturing ML systems: evaluation metrics, scalable tooling, and pipeline development

Public contributions to GitHub or open-source projects

Experience with CPU/GPU code optimization

Master’s degree or higher in Computer Vision, Machine Learning, or a related field

Education

Any Gradute