We are seeking a capable, motivated generalist who thrives in a change-controlled, compliant environment and enjoys working across hybrid cloud and on-premises systems. The Staff Site Reliability Engineer will lead end-to-end delivery of complex technical initiatives, own the design, implementation, and reliability of systems, and partner with application architecture and peer teams.
Requirements
- Bachelor's degree in Computer Science or a related field, 7+ years of experience, or equivalent demonstrated impact in SRE, DevOps, or Infrastructure Engineering
- Broad technical experience across infrastructure and distributed systems
- Strong understanding of distributed systems behavior
- Experience operating in regulated, compliant, or change-controlled environments
- Experience working in hybrid environments (AWS preferred; on-premises infrastructure required)
- Strong experience with Infrastructure as Code, configuration management, and orchestration tools (Terraform, Helm, Kustomize, Ansible)
- Experience with Kubernetes and virtualization technologies
- Experience with observability platforms (e.g., Datadog), including building monitoring and alerting integrations
- Experience with build and release systems (e.g., GitHub Actions, Makefiles, Python tooling)