
Job description
NVIDIA is looking for a Software Engineer to work on bring-up, triage, benchmarking, analysis, and optimization of distributed training and inference workloads across NVIDIA GPU platforms at the largest scales.
Bring up, validate, and debug large-scale AI clusters, infrastructure, and end-to-end workloads. Bring up, tune, and benchmark AI pre-training, post-training, and inference workloads using PyTorch, NeMo / Megatron, TensorRT-LLM, and adjacent NVIDIA AI software stacks.
3+ years of experience developing software for AI, HPC, or systems-level applications. Hands-on experience with multi-GPU or multi-node workloads and CUDA-aware distributed execution. Excellent analytical, debugging, and communication skills, and a collaborative approach across teams.
Company
Keep exploring
Sign in to see similar jobs
Create a free account to discover roles related to this posting.

Tech, Software & IT Services
NVIDIA, founded in 1993, is a leading full‑stack computing company that designs and manufactures GPUs and related technologies. Its products power a wide spectrum of applications—from high‑performance gaming and professional graphics to AI, deep learning, and autonomous vehicle systems—while its data‑center solutions enable large‑scale supercomputing and virtualization. NVIDIA’s pioneering GPU architecture has driven the growth of PC gaming, catalyzed the modern AI era, and continues to shape emerging fields such as the metaverse. The company’s integrated hardware‑software ecosystem delivers unprecedented performance and scalability, positioning NVIDIA as a key enabler of next‑generation computing across automotive, robotics, and enterprise sectors.