Join us and we promise you the most intense and fulfilling years of your career, doing life-changing work in a fun, inventive, soulful culture.
Requirements
- Leverage AI-powered tools and automation to transform how we monitor, troubleshoot, and maintain production systems
- Build and operate cloud infrastructure on AWS, using Terraform to codify and version-control our entire environment
- Manage and scale Kubernetes clusters that power BetterUp's platform, ensuring high availability and performance
- Design intelligent alerting and observability systems
- Collaborate with engineering teams to embed reliability into the development lifecycle, shifting left on operational concerns
- Automate incident response workflows and build self-healing infrastructure
- Experiment with and adopt emerging AI tools for log analysis, anomaly detection, and predictive maintenance
- Drive continuous improvement through data-driven retrospectives and reliability metrics
Benefits
- Access to BetterUp coaching
- A competitive compensation plan with opportunity for advancement
- Medical, dental, and vision insurance
- Flexible paid time off
- Per year: All federal/statutory holidays observed
- 4 BetterUp Inner Workdays
- 5 Volunteer Days to give back
- Learning and Development stipend
- Company wide Summer & Winter breaks
- Year-round charitable contribution of your choice on behalf of BetterUp
- 401(k) self contribution