We are seeking a GPU Performance Engineer to optimize our model serving stack and GPU infrastructure to achieve 5-10x speedups. The ideal candidate has expertise in GPU profiling tools, CUDA programming, and GPU architecture, with a strong track record of achieving significant performance improvements.
Requirements
- Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field
- 5+ years systems programming experience with 3+ years focused on GPU optimization
- Expert proficiency with GPU profiling tools (Nsight Systems, nvprof)
- Strong CUDA programming skills with production kernel development
- Deep understanding of GPU architecture (memory hierarchy, SMs, warps)
- Track record of achieving significant performance improvements (5-10x)
- Experience with Python and C++ in production environments