At SiteMinder, we're seeking a Senior Site Reliability Engineer to contribute to the architecture of our data platform infrastructure. As a key member of the Infrastructure Platform team, you'll be responsible for designing, building, and maintaining scalable and reliable production-grade infrastructure using Terraform, and partnering with Data Engineers and Data Scientists to translate their requirements into scalable, secure systems.
Requirements
- Extensive experience in infrastructure platform engineering (SRE, DevOps, Platform)
- Strong proficiency in at least one programming language (e.g. Python, Go)
- Strong Linux administration skills and security hardening experience
- Demonstrated deep experience with Terraform at scale
- Demonstrated deep experience with AWS — VPC and networking, IAM, EC2 / EKS / ECS, S3, monitoring and logging
- Demonstrated deep experience in Kubernetes cluster administration
- Understanding of MLOps and LLMOps principles
- Experience with big data and streaming technologies (e.g. Spark, Hadoop, Kafka) is highly desirable
- Industry certifications preferred — e.g. CKA, CKAD, AWS Solutions Architect Professional, AWS DevOps Engineer Professional
- Exposure to security and compliance frameworks such as PCI DSS, GDPR, and ISO 27001
Benefits
- Mental health and well-being initiatives
- Generous parental (including secondary) leave policy
- Flexibility to work in a Hybrid model (2-3 days in-office)
- Paid birthday, study and volunteering leave every year
- Sponsored social clubs, team events, and celebrations
- Employee Resource Groups (ERG) to help you connect and get involved
- Investment in your personal growth offering training for your advancement