We are seeking a Principal Engineer in Cluster Orchestration to lead the design and evolution of our cluster orchestration systems, define long-term architecture, and solve hard scaling problems to support AI infrastructure.
Requirements
- 15+ years of experience building and operating large-scale distributed systems
- Deep, practical knowledge of Kubernetes and Slurm internals
- Experience running GPU-heavy platforms for AI training, inference, or HPC workloads
- Strong background in Go and cloud-native systems development
- Proven ability to set technical direction across teams without direct authority
Benefits
- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption