Remote is solving modern organizations' biggest challenge – navigating global employment compliantly with ease. As a Staff SRE at Remote, you will own the technical direction of our SRE platform, shaping its architecture, reliability strategy, and long-term evolution.
Requirements
- 8+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering
- Deep expertise in Kubernetes: operating, designing, and scaling production clusters
- Proven experience designing and managing cloud infrastructure on AWS (or other cloud providers) at scale
- Strong infrastructure-as-code practice with Terraform
- Experience defining and operating reliability frameworks: SLOs, SLIs, error budgets, alerting strategies
- Solid observability background: Datadog, Grafana/Prometheus, or similar
- Proficiency with CI/CD platforms (GitLab CI, GitHub Actions, or similar) and deployment automation
- Comfortable with Bash and scripting for automation; broader programming skills are a plus
- Experience with container tooling (Docker) and the broader ecosystem around it
- Curiosity and practical experience applying AI tools to infrastructure, operations, or developer tooling
Benefits
- Flexible paid time off
- Flexible working hours (async)
- 16 weeks paid parental leave
- Mental health support services
- Stock options
- Learning budget
- Home office budget & IT equipment
- Budget for local in-person social events or co-working spaces