Description

Responsibilities:

  • Identify and resolve performance issues and bugs related to CUDA implementations.
  • Analyze and optimize algorithm performance for maximum efficiency through parallelization.
  • Collaborate with cross-functional teams to integrate CUDA optimizations into existing software architecture.
  • Stay updated with the latest trends, tools, and best practices in CUDA and parallel computing.
  • Develop and maintain code for GPU-accelerated applications using CUDA.
  • Implement code optimizations for memory usage and computational efficiency.


Required Qualifications:

  • Bachelor's degree in Computer Science, Computer Engineering, or related fields.
  • Minimum of 5 years of software development experience, with at least 2 years specifically in CUDA.
  • Proven experience in optimizing performance and debugging CUDA applications.
  • Ability to work both independently and collaboratively within a team.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication skills to effectively work with technical and non-technical teams.


Knowledge and Skills:

  • Proficiency in C/C++ programming languages.
  • Expertise in CUDA Performance Optimization and best practices.
  • Experience with CUDA version 12.0 or above (preferably 12.3).
  • Familiarity with frameworks and libraries such as cuBLAS, cuDNN, and Thrust.
  • Strong understanding of GPU architectures and resource management.
  • Knowledge of parallel computing patterns, memory management, and multi-threading.
  • Experience with Large Language Models (LLMs) and their deployment on GPU infrastructure

Key Skills
Education

Bachelor's Degree