Coalition is seeking a Senior Site Reliability Engineer to join its Platform SRE team. The successful candidate will design, build, and scale production environments using AWS and Terraform, and will be responsible for infrastructure automation, system reliability, developer enablement, and observability.
Requirements
- 6+ years of experience in SRE, DevOps, Cloud Engineering, or Software Development roles
- Hands-on experience operating production environments in AWS
- Proficiency in Go or Python, with experience building production-grade automation, tooling or libraries
- Strong experience with Terraform
- Experience with container orchestration platforms like ECS or Kubernetes
- Familiarity with CI/CD tools such as GitHub Actions
- Experience designing and implementing re-usable platform components based on team requirements
- Solid understanding of observability practices including system metrics, distributed tracing, and SLOs
- Exposure to failure-based testing approaches and automated recovery strategies
- Strong leadership and communication skills, both written and verbal
- Experience evangelizing reliability best practices
Benefits
- 100% medical, dental, and vision coverage
- Flexible PTO
- Annual home office stipend and WeWork access
- Mental & physical health wellness programs like Headspace, Lumino, and more!
- Competitive compensation and opportunity for advancement