The NVIDIA Math Libraries team is looking for a senior engineer to join their development efforts in kernel generation for AI and HPC, targeting matrix operations, JITing and fusions.
Requirements
- PhD, Master’s, or Bachelor's degree in Computer Science, Applied Math, or related science or engineering field of study (or equivalent experience)
- 8+ years of experience in designing, developing, testing, maintenance, and performance optimization of HPC software using C++
- Strong fundamentals in kernel generation and composable library design for linear algebra
- Leadership skills in driving software development projects
- Strong collaboration, communication, and documentation habits
- Kernel generation. JIT focus/experience desired
- Experience with parallel programming, ideally using CUDA, MPI, OpenMP, OpenACC, pthreads
- Good understanding of Machine Learning and Deep Learning technologies as well as knowledge of GPU (preferred) or CPU hardware architecture
- Experience with low level programming using assembly for performance optimization and operator fusion is a huge plus
- Experience with agile software development practices using project management tools such as JIRA
- A scripting language, preferably Python
Benefits
- Competitive salary package
- Benefits