Description

About the job
Key Skills: Computer architecture, data structures, system software, and machine learning, C/C++ and Python, Linux environment

Roles and Responsibilities:

Develop, optimize, and maintain kernel-level software for AI and machine learning workloads.
Implement algorithms for specialized hardware, including FPGAs, DSPs, GPUs, and AI accelerators using libraries such as CuDA.
Design and implement operators used in ML workloads, including GEMMs, convolutions, BLAS, softmax, layer normalization, and pooling operations.
Develop software for embedded SIMD vector processors such as Tensilica and ensure high-performance execution of ML operators.
Contribute to software design, architecture, and development of core components for AI compute systems.
Debug, profile, and optimize performance-critical components at the kernel and system levels.
Skills Required:

7+ years of industry experience or PhD in Computer Engineering, Math, Physics or related degree with 5+ years of industry experience.
Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals.
Proficient in C/C++ and Python development in Linux environment and using standard development tools.
Experience implementing algorithms in high-level languages such as C/C++ and Python.
Experience implementing algorithms for specialized hardware such as FPGAs, DSPs, GPUs, and AI accelerators using libraries such as CuDA, etc.
Experience in implementing operators commonly used in ML workloads - GEMMs, Convolutions, BLAS, SIMD operators for operations like softmax, layer normalization, pooling, etc.
Experience with development for embedded SIMD vector processors such as Tensilica.
Self-motivated team player with a strong sense of ownership and leadership.
Preferred:
Prior startup, small team, or incubation experience.
Experience with ML frameworks such as TensorFlow and or PyTorch.
Experience working with ML compilers and algorithms, such as MLIR, LLVM, TVM, Glow, etc.
Experience with a deep learning framework (such as PyTorch or Tensorflow) and ML models for CV, NLP, or Recommendation.
Work experience at a cloud provider or AI compute / sub-system company.
Education: MS in Computer Engineering, Math, Physics or related

Education

MS in Computer Engineering