Hewlett Packard Enterprise is seeking a Site Reliability Engineer Sr. Staff to design, build, and optimize cloud infrastructure and deployment systems. The role requires a minimum of 10 years of experience in Infra Ops, Dev Ops, or SRE, with expertise in Linux systems, cloud platforms, and infrastructure as code tools.
Requirements
- Minimum of 10 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE)
- Proficiency with Linux systems, especially Debian-based distributions
- Strong experience with cloud platforms such as AWS and GCP
- Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible
- Solid programming skills in Python and/or Golang
- Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE)
- Experience with GitOps workflows
- Proven track record in implementing and maintaining CI/CD pipelines
- Strong background in security and familiarity with security programs
- Experience with monitoring and logging tools (Prometheus, Grafana, ELK)
- Knowledge of both relational (SQL) and non-relational databases
- Excellent problem-solving and debugging skills with a strong sense of ownership
- Experience managing distributed systems like Apache Kafka and Cassandra
- Effective communicator and collaborative team player
Benefits
- Health & Wellbeing
- Personal & Professional Development
- Unconditional Inclusion