Playson is seeking an experienced Principal Site Reliability Engineer to join their dynamic Platform Tribe. The role involves managing day-to-day alerts, providing on-call support, and collaborating with other teams to provide top-notch support and assistance.
Requirements
- Proficiency in Kubernetes
- Experience with configuration management tools like FluxCD/ArgoCD
- Strong experience with issue processing (RCA, Postmortems)
- Familiarity with AWS, Terraform, Docker, CI/CD
- Experience with monitoring tools like DataDog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS CloudWatch
- Strong understanding of networking concepts and protocols
- Proficiency in at least one scripting language (e.g., Python, NodeJS, Go)
- Proficiency in Git or other version control systems
- Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps
- Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform
Benefits
- Competitive Salary
- Annual performance/salary reviews
- Realistic and transparent Bonus system (15-20%)
- Unlimited paid vacation leave & paid sick leave
- Flexible work schedule
- 100% Remote
- Financial Support for Life Events & Extended Parental Leave
- Paid professional development courses and trainings
- B2B contracts