Airalo is looking for a Senior Site Reliability Engineer to join their growing engineering team. The successful candidate will be responsible for designing scalable, fault-tolerant, and self-healing systems in a multi-region AWS environment.
Requirements
- Bachelor’s degree in Computer Engineering or a similar discipline
- 5+ years of experience as a Site Reliability Engineer or in a similar role
- 3+ years of experience with AWS services including strong knowledge of container orchestration
- 2+ years of Kubernetes experience
- Deep understanding of observability principles and tools like Prometheus, Datadog, OpenTelemetry
- Experience with leading incident management and complex postmortem analysis
- Experience and interest in managing infrastructure as code (Terraform)
- Experience with chaos engineering and other techniques for testing system resilience
- Experience with CI/CD tools such as GitHub Actions
- Proficiency in at least one programming language (Python, Go, Java, etc.) for building automation and internal tooling
- Event-driven architecture experience (SNS, SQS etc)
- Ability to work independently and collaboratively in a fast-paced environment
- Team player and open to new ideas
- Good communication skills and fluency in English
Benefits
- Health Insurance
- Work-from-anywhere stipend
- Annual wellness & learning credits
- Annual all-expenses-paid company retreat in a gorgeous destination & other benefits