NVIDIA is seeking an experienced Solutions Architect to be a trusted technical advisor, bridging design to deployment of large-scale AI / HPC GPU infrastructure.
Requirements
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, Mathematics, or other Engineering fields or equivalent experience.
- 10+ years of Solution Engineering (or similar Sales Engineering, Cloud Engineering, Solution Architecture) including experience working directly with partners and customers.
- Experience crafting and deploying large-scale cluster environments, hands-on experience designing, developing, delivering distributed Cloud architectures.
- Strong fundamentals in programming, optimizations and software design, especially in Python and Deep Learning frameworks such as PyTorch and TensorFlow.
- Practical expertise fine tuning and deploying models, integrating software application stacks, libraries, and frameworks to drive consumption from GPU platforms.
- Motivation and skills to own and drive complex multi-disciplinary technical engagements with customers throughout the full customer lifecycle and cross-functional teams.
- Efficient time management and capable of balancing multiple tasks. Excellent presentation, communication and collaboration skills.
- Self-starter with a passion for growth, continuous learning, and sharing insights.
- Practical experience with NVIDIA GPUs, software libraries, frameworks, and foundation models, such as NVIDIA Nemotron, NVIDIA NeMo Framework, NVIDIA Dynamo, NeMo Retriever, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM, NVIDIA CUDA-X
- Hands-on expertise with scaled AI cloud environments (e.g., AWS, Azure, GCP) and on-premises / hybrid infrastructure, in particular inference and training workloads.
- Familiarity with NVIDIA hardware (such as GPUs, networking, storage) and systems technology such as NCCL, DCGM, UFM, Mission Control, Base Command Manager.
- Proficiency with large-scale AI model training / deployment encompassing GPU systems, performance testing, AI benchmarking, fine tuning, strong focus on MLOps and cluster orchestration (SLURM, K8s, orchestrator, load balancing, cloud architecture).
- Experience working with enterprise developers and strong customer-facing skills.
Benefits
- Eligible for equity
- Benefits