At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If you’re a close but not exact match with the description, we hope you’ll still consider applying. Want to learn more about life at Klaviyo? Visit careers.klaviyo.com to see how we empower creators to own their own destiny.
Requirements
- Build, operate, and improve production systems with a focus on reliability, scalability, and performance
- Apply software engineering principles to automate operational tasks and reduce manual toil
- Contribute to the design and implementation of systems using established SRE best practices
- Help define and measure SLIs and SLOs for services you support
- Improve observability through metrics, dashboards, logging, and tracing
- Participate in on-call rotations and respond to production incidents with guidance and support
- Assist with incident investigation and contribute to post-incident reviews and follow-up actions
- Perform basic analysis around system behavior, capacity usage, and scaling characteristics
- Identify reliability issues or operational pain points and work with teammates to address them
- Collaborate with product, platform, and security engineers to ship reliable systems
- Write and maintain clear operational runbooks and system documentation
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Visa Sponsorship
- Four Day Work Week
- Generous Parental Leave
- Tuition Reimbursement
- Relocation Assistance