We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. We are seeking a Foundation Model DevOps Engineer focused on Operational Stability to serve as the backbone of our AI research infrastructure.
Requirements
- A bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.
- 3+ years of experience in DevOps, Release Engineering, or MLE, specifically within AI/ML or HPC environments.
- Foundation Model Fluency: You understand the lifecycle of training large models (LLMs or Diffusion).
- Linux/Unix Fluency: You live in the command line.
- Version Control Admin: Expert-level administration of GitHub Enterprise (managing teams, API limits, and repository security).
- Scripting & Automation: Proficiency in Python or Bash to automate repetitive administrative tasks.
Benefits
- Comprehensive medical, dental, and vision benefits
- Bonus
- 401K Plan
- Generous paid time off, sick leave and holidays
- Paid Parental Leave
- Employee Assistance Program
- Life insurance and disability