We are seeking a Site Reliability Engineer to design and implement enterprise-grade monitoring and observability frameworks, establish and manage SLIs, SLOs, and error budgets, and develop and maintain real-time asset inventory systems across cloud, on-prem, and hybrid environments.
Requirements
- 6+ years of experience in DevOps/SRE roles with monitoring and observability tools
- 4+ years of hands-on Linux experience
- 4+ years of experience automating Infrastructure-as-Code (IaC) deployments
- Strong scripting skills (Python, Bash, PowerShell or similar)
- Proficiency in CI/CD and automation tools
- Cloud certifications is preferred
- Certifications in Grafana, Splunk, Docker, Kubernetes is preferred but optional
Benefits
- 100% Medical, Dental & Vision Coverage for Employees
- Paid Time Off and Paid Holidays
- 401K match up to 5%
- Educational Benefits for Career Growth
- Employee Referral Bonus
- Flexible Spending Accounts