Okta is seeking a highly technical Observability Site Reliability Engineer with expertise in Google Cloud to own and expand the Observability ecosystem into GCP. The role involves designing, building, and maintaining scalable observability infrastructure and optimizing the collection, processing, and storage of Observability data.
Requirements
- Automated Infrastructure: Design, build, and maintain scalable observability infrastructure using tools like Terraform.
- GCP Observabilty Engineering: Optimize the collection, processing, and storage of Observability data to ensure high reliability and low latency of our Splunk and Grafana services
- Incident Response: Participate in on-call rotations and lead post-incident reviews to drive systemic improvements and "observability-driven development."
- Automation: Eliminate "toil" by automating the deployment and scaling of observability agents and collectors.
Benefits
- Health, dental and vision insurance
- 401(k)
- Flexible spending account
- Paid leave (including PTO and parental leave)