We are looking for a Senior Software Engineer to join our Observability team and help build the platform that gives Redpanda’s engineering organization deep visibility into the health, performance, and behavior of our systems.
Requirements
- 5+ years of experience in software engineering with a focus on observability, monitoring, or infrastructure
- Deep hands-on experience with the Grafana stack (Grafana, Mimir/Prometheus, Loki, Tempo) in production environments
- Strong understanding of metrics, logging, and distributed tracing paradigms and their trade-offs at scale
- Experience with OpenTelemetry (OTel) for instrumentation and telemetry collection
- Proficiency in at least one systems-level language (Go strongly preferred) and scripting languages (Python, Bash)
- Experience running and operating infrastructure on Kubernetes in public cloud environments (AWS, GCP, or Azure)
- Comfortable working with a 100% distributed engineering team, collaborating on GitHub, etc.
- Solid understanding of time-series databases, log aggregation systems, and query languages (PromQL, LogQL)