The compute infrastructure team runs the GPU fleet and large-scale compute clusters that serve the models backing ChatGPT and the API. The Technical Program Manager for Compute Infrastructure will own the end-to-end delivery of large-scale GPU clusters, partnering with engineers to bring clusters online across external providers and partners.
Requirements
- Possess a degree in a hard science, or have a demonstrated track record of engineering expertise.
- Have 5+ years of experience in program management for major projects including capital projects or hyperscaler infrastructure deployment.
- Demonstrate the ability to serve as the go-to person solely responsible for driving and delivering complex projects.
- Are comfortable managing cross-functional and cross-company teams; experience driving information and decision hygiene.
- Have an extensive track record of successfully delivering high-profile, technical projects against tight deadlines.
- Are technically adept and have effectively partnered with engineering or fundamental research teams of the highest caliber.
- Have experience interfacing with and leading external vendors including engineering firms, equipment suppliers, and/or construction firms.
- Have expertise in designing and implementing simple, scalable processes that solve complex problems.
- Have experience managing complicated dependencies such as logistics and/or supply chains.
- Are relentlessly resourceful and thrive in ambiguous, fast-paced environments.
- Are interested in and thoughtful about the impacts of AGI.
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Relocation Assistance
- Four Day Work Week
- Generous Parental Leave
- Tuition Reimbursement