Role Overview

Join NVIDIA's production engineering team to build automation, tooling, and operational systems for large-scale GPU infrastructure. Focus on Kubernetes-based infrastructure, GPU cluster operations, reliability, automation, GitOps, and Day 2 operability.

What You Will Do

Build and operate automation for large-scale GPU clusters, develop tools and services for provisioning, validation, upgrades, monitoring, repair, and cluster lifecycle operations.

Why It Might Be a Fit

8+ years of experience in production infrastructure, strong programming skills in Python, Go, or similar, and ability to troubleshoot distributed systems in production.

Requirements

8+ years of experience building or operating production infrastructure
Strong programming skills in Python, Go, or similar
Experience with Linux, Kubernetes, containers, cloud infrastructure, or infrastructure automation
Ability to troubleshoot distributed systems in production
BS/MS in Computer Science or equivalent experience

Benefits

equity
benefits

Role Overview

What You Will Do

Build and operate automation for large-scale GPU clusters, develop tools and services for provisioning, validation, upgrades, monitoring, repair, and cluster lifecycle operations.

Why It Might Be a Fit

8+ years of experience in production infrastructure, strong programming skills in Python, Go, or similar, and ability to troubleshoot distributed systems in production.

Requirements

8+ years of experience building or operating production infrastructure
Strong programming skills in Python, Go, or similar
Experience with Linux, Kubernetes, containers, cloud infrastructure, or infrastructure automation
Ability to troubleshoot distributed systems in production
BS/MS in Computer Science or equivalent experience

Benefits

equity
benefits

Senior Software Engineer, DGX Cloud Production Engineering

About the role

Role Overview

What You Will Do

Why It Might Be a Fit

Requirements

Benefits

About NVIDIA

Products

Use Cases

Insights

Resources

Browse Jobs

Company

Senior Software Engineer, DGX Cloud Production Engineering

About the role

Role Overview

What You Will Do

Why It Might Be a Fit

Requirements

Benefits

About NVIDIA

Similar jobs