The Site Reliability Engineering team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives.
Requirements
- Write high-quality infrastructure-as-code that automates the provisioning, deployment, scaling, and monitoring of Pendo’s infrastructure to ensure that it is reliable and performant
- Write maintainable code for product functionality with a primary emphasis on operations, scale, resiliency, and monitoring
- Debug production issues, learn to mitigate them quickly, and find ways to prevent them
- Maintain runbooks for manual tasks and replace those runbooks with automation whenever possible
- Proactively track our capacity, quotas, and other performance limits to plan for growth
- Participate in a 24x7 on-call rotation to handle product availability issues as well as urgent customer support escalations
Benefits
- Competitive salary
- Benefits and reward opportunities