GM is looking for a Senior AI/ML Capacity and Performance Engineer to support the development of autonomous vehicles. The role involves strategic infrastructure development, performance optimization, cross-functional collaboration, and proactive system scaling.
Requirements
- 5+ years of professional experience in high-scale infrastructure or ML systems
- Bachelor’s Degree in Computer Science, a related technical field, or equivalent practical experience
- Expert-level coding skills in Python and the ability to architect/debug within the PyTorch ecosystem
- Proven track record of resolving performance issues within large-scale distributed production environments
- Deep understanding of distributed systems, specifically modern ML system design and high-performance computing (HPC)
- Hands-on experience with Kubernetes for orchestrating complex workloads
- Technical proficiency with Nvidia DCGM, nvidia-smi, and Grafana for real-time telemetry and observability
- Extensive experience working within major cloud ecosystems (AWS, GCP, or Azure)
Benefits
- Medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts