Job description
Join NVIDIA's Hardware Infrastructure Farm team as a HPC Operations Engineer to design and implement groundbreaking compute clusters that power all silicon development across NVIDIA. As an expert in high-performance computing, you will build and operate these clusters at high reliability, efficiency, and performance, driving foundational improvements and automation to improve engineers' productivity. The role requires a deep understanding of Linux, container technologies, and cluster configuration management tools, as well as excellent problem-solving and communication skills. The team is diverse and supportive, and NVIDIA encourages collaboration, intellectual curiosity, and problem-solving in a blame-free environment. This is an exciting opportunity to make a lasting impact on the world of AI and computing.
Keep exploring
Sign in to see similar jobs
Create a free account to discover roles related to this posting.
Company
Tech, Software & IT Services
NVIDIA is a leader in the field of AI and computing, with a unique legacy of innovation that's fueled by great technology—and amazing people.