We are looking for a highly motivated and high-potential Senior Manager Site Reliability Engineering (SRE) to join our team as a technical leader and drive transformative impact across WEX’s platform reliability and operational excellence.
Requirements
- 8+ years of experience with a focus on large-scale system reliability
- Expertise in system architecture, cloud platforms, and automation frameworks
- Deep knowledge of Kubernetes, service meshes, and distributed tracing
- Experience with monitoring and logging (Grafana, ELK stack, Splunk, etc.)
- Knowledge of containerization and orchestration (Docker, Kubernetes)
- Experience designing high-availability, fault-tolerant architectures
- Strong understanding of database reliability engineering (MySQL, PostgreSQL, NoSQL). Knowledge of networking, databases, and storage architectures
- Excellent incident command and crisis management skills
- Experience setting team OKRs and aligning reliability goals with product and platform engineering strategies
Benefits
- health, dental and vision insurances
- retirement savings plan
- paid time off
- health savings account
- flexible spending accounts
- life insurance
- disability insurance
- tuition reimbursement