Senior Site Reliability Engineer role at Zuora, leading reliability strategy with AI-driven automation and intelligent operations. Requires 8+ years of experience in SRE, DevOps, or large-scale production operations. Must have advanced expertise in AWS and Terraform.
Requirements
- 8+ years of hands-on experience in Site Reliability Engineering, DevOps, or large-scale production operations
- Advanced expertise in AWS, including architecture design across services such as EC2, EKS, VPC, IAM, RDS, S3, and CloudWatch
- Deep experience with Infrastructure-as-Code using Terraform, including complex modules, state management, and governance
- Strong programming and automation skills using Python and Shell
- Expert-level Linux systems knowledge, including performance tuning, security hardening, and deep troubleshooting
- Proven experience operating distributed systems and data streaming platforms such as Kafka in high-throughput environments
- Demonstrated ability to work independently on complex, ambiguous problems with broad organizational impact
- Proven technical leadership experience driving large, cross-team reliability or infrastructure initiatives
Benefits
- Competitive compensation
- Variable bonus and performance reward opportunities
- Retirement programs
- Medical Insurance
- Generous, flexible time off
- Paid holidays
- Wellness days
- Company-wide end of year break
- 6 months fully paid parental leave
- Learning & Development stipend
- Opportunities to volunteer and give back
- Charitable donation match
- Free resources and support for your mental wellbeing