The Site Reliability Engineer Staff will design, build, and optimize cloud infrastructure and deployment systems, impacting scalability, security, and operational efficiency across platforms. Key responsibilities include enhancing Infrastructure as Code, optimizing cloud infrastructure, developing internal tools, and addressing container image vulnerabilities.

Requirements

Minimum of 6 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE)
Proficiency with Linux systems, especially Debian-based distributions
Strong experience with cloud platforms such as AWS and GCP
Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible
Solid programming skills in Python and/or Golang
Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE)
Experience with GitOps workflows
Proven track record in implementing and maintaining CI/CD pipelines
Strong background in security and familiarity with security programs
Experience with monitoring and logging tools (Prometheus, Grafana, ELK)
Knowledge of both relational (SQL) and non-relational databases
Excellent problem-solving and debugging skills with a strong sense of ownership
Experience managing distributed systems like Apache Kafka and Cassandra
Effective communicator and collaborative team player

Benefits

Health & Wellbeing
Personal & Professional Development
Unconditional Inclusion

Requirements

Minimum of 6 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE)
Proficiency with Linux systems, especially Debian-based distributions
Strong experience with cloud platforms such as AWS and GCP
Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible
Solid programming skills in Python and/or Golang
Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE)
Experience with GitOps workflows
Proven track record in implementing and maintaining CI/CD pipelines
Strong background in security and familiarity with security programs
Experience with monitoring and logging tools (Prometheus, Grafana, ELK)
Knowledge of both relational (SQL) and non-relational databases
Excellent problem-solving and debugging skills with a strong sense of ownership
Experience managing distributed systems like Apache Kafka and Cassandra
Effective communicator and collaborative team player

Benefits

Health & Wellbeing
Personal & Professional Development
Unconditional Inclusion

Site Reliability Engineer Staff

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Site Reliability Engineer Staff

Site Reliability Engineer Staff

Site Reliability Engineer Sr. Staff

Site Reliability Engineer Staff

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Site Reliability Engineer Staff

Site Reliability Engineer Staff

Site Reliability Engineer Sr. Staff

Job Details

About Workday