As a Site Reliability Engineer Staff, you will play a key role in designing, building, and optimizing cloud infrastructure and deployment systems. You will enhance Infrastructure as Code (IAC) and enforce best practices, optimize cloud infrastructure for scalability, security, and cost-effectiveness, and troubleshoot complex production issues to ensure system reliability and customer satisfaction.

Requirements

Minimum of 6 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE).
Proficiency with Linux systems, especially Debian-based distributions.
Strong experience with cloud platforms such as AWS and GCP.
Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible.
Solid programming skills in Python and/or Golang.
Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE).
Experience with GitOps workflows.
Proven track record in implementing and maintaining CI/CD pipelines.
Strong background in security and familiarity with security programs.
Experience with monitoring and logging tools (Prometheus, Grafana, ELK).
Knowledge of both relational (SQL) and non-relational databases.
Excellent problem-solving and debugging skills with a strong sense of ownership.
Experience managing distributed systems like Apache Kafka and Cassandra.
Effective communicator and collaborative team player.

Benefits

Health & Wellbeing
Personal & Professional Development
Unconditional Inclusion

Requirements

Minimum of 6 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE).
Proficiency with Linux systems, especially Debian-based distributions.
Strong experience with cloud platforms such as AWS and GCP.
Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible.
Solid programming skills in Python and/or Golang.
Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE).
Experience with GitOps workflows.
Proven track record in implementing and maintaining CI/CD pipelines.
Strong background in security and familiarity with security programs.
Experience with monitoring and logging tools (Prometheus, Grafana, ELK).
Knowledge of both relational (SQL) and non-relational databases.
Excellent problem-solving and debugging skills with a strong sense of ownership.
Experience managing distributed systems like Apache Kafka and Cassandra.
Effective communicator and collaborative team player.

Benefits

Health & Wellbeing
Personal & Professional Development
Unconditional Inclusion

Site Reliability Engineer Staff

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Site Reliability Engineer Staff

Site Reliability Engineer Staff

Site Reliability Engineer Staff

Site Reliability Engineer Staff

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Site Reliability Engineer Staff

Site Reliability Engineer Staff

Site Reliability Engineer Staff

Job Details

About Hewlett Packard Enterprise