We are seeking a Site Reliability Engineer to join the Observability group inside our Platform Engineering domain. You will build tools for monitoring and measuring infrastructure, microservices, and workloads, and contribute to understanding and preventing incidents through tooling, automation, and people-centered processes.
Requirements
- Understanding of the basic building blocks of observability: metrics, logs, and traces
- Experience using the right tools to extract data (think things like Prometheus, StatsD, OpenTelemetry libraries), transform data (tools like Vector, Beats, FluentBit), and load data (OpenSearch, Grafana, Datadog, etc.)
- Solid skills with at least one glue language like GoLang or Python
- Ability to build in a cloud-only environment with infrastructure as code tools like Terraform and AWS CDK
- Brain that runs on Linux
Benefits
- Competitive personal development budget
- Work from home budget
- Discounts to fitness & wellness memberships
- Language apps
- Public transportation
- Premium subscription on personal N26 bank account
- Subscriptions for friends and family members
- Additional day of annual leave for each year of service
- High degree of autonomy
- Access to cutting edge technologies