Coterie is seeking a Site Reliability Engineer to join their team. The ideal candidate will have 3+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure. They will be responsible for managing and maintaining cloud infrastructure, building and improving CI/CD pipelines, and collaborating with development teams to define and track SLIs, SLOs, and error budgets.
Requirements
- 3+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure
- Strong hands-on experience with Azure Cloud services and resource management
- Kubernetes and AKS administration, including deployments, networking, and troubleshooting
- GitHub Actions for CI/CD pipeline development and maintenance
- Solid experience with Grafana, including dashboard creation, alerting configuration, and incident management
- Hands-on experience with Prometheus, Loki, or other observability tools in the Grafana ecosystem
- Proficiency in at least one scripting or programming language such as Python or Bash
- Understanding of networking fundamentals, DNS, load balancing, and container orchestration concepts
- Strong analytical and communication skills; able to diagnose complex system issues and clearly communicate findings
- Demonstrated ability to collaborate across teams and contribute to a culture of reliability
- Experience working in an agile environment with modern DevOps practices
Benefits
- 100% remote
- Health insurance through Aetna (we pay 100% of premiums)
- Dental and vision insurance through Guardian (we pay 100% of premiums)
- Basic life insurance (we pay 100% of premiums)
- Access to flexible spending account (FSA) or health savings account (HSA) (for those using HSA eligible plans)
- 401K plan (up 4% match with immediate vest)
- Flexible PTO policy offering up to 3 weeks of time off to support onboarding and integration during the first twelve months of employment
- 12 company-paid holidays each year
- Continuing education annual stipend