NVIDIA is seeking an experienced Cloud Solution Architect to help customers adopt GPU hardware and Software, as well as build and deploy Machine Learning (ML), Deep Learning (DL), data analytics solutions on various Cloud Computing Platforms.
Requirements
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Statistics, Physics, or other Engineering fields or equivalent experience.
- 3+ Years in Solutions Architecture with a proven track record of moving AI inference from POC to production in cloud computing environments including AWS, GCP, or Azure
- 3+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow
- Excellent knowledge of the theory and practice of LLM and DL inference
- Strong fundamentals in programming, optimizations, and software design, especially in Python
- Experience with containerization and orchestration technologies like Docker and Kubernetes, monitoring, and observability solutions for AI deployments
- Knowledge of Inference technologies - NVIDIA NIM, TensorRT-LLM, Dynamo, Triton Inference Server, vLLM, etc
- Proficiency in problem-solving and debugging skills in GPU environments
- Excellent presentation, communication and collaboration skills
Benefits