Flip is the leading AI-powered employee experience platform for frontline workers. As a Senior Site Reliability Engineer, you'll own critical reliability domains end-to-end and drive the technical direction within the squad.
Requirements
- 5+ years of hands-on experience as a Site Reliability Engineer, Platform Engineer, DevOps Engineer, Infrastructure Engineer, Cloud Engineer, or Backend Engineer with a strong infrastructure focus.
- Proven track record building and operating high-throughput, highly available systems in production.
- Deep, production-level experience with Kubernetes on any Hyperscaler.
- Strong experience with modern observability stacks and a clear point of view on SLIs, SLOs and error budgets.
- Solid software development skills in Go (strongly preferred) or Python.
- Hands-on experience with Infrastructure as Code (Pulumi, OpenTofu, Terraform) and GitOps (e.g. ArgoCD) + CI/CD pipeline design.
- Demonstrated ability to lead complex infrastructure initiatives from design to production.
- Experience mentoring engineers and raising the technical bar within a team.
- Comfortable owning major incidents end-to-end and turning learnings into systemic change.
- Strong communication skills and business-fluent English.
- Willingness to participate in on-call rotations to ensure the reliability of our platform.
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Remote work options
- Flexible work-life balance
- E-Gym-Wellpass membership
- Job bike leasing