NTT DATA is seeking a Site Reliability Engineering (SRE) / Lead Engineer to join our team in Guadalajara, Jalisco (MX-JAL), Mexico (MX).
Requirements
- 8-10+ years of experience in SRE, Observability, or DevOps roles, with leadership responsibilities.
- Hands-on experience with OpenTelemetry for distributed tracing and observability instrumentation.
- Proven expertise with Application Performance Monitoring (APM) tools such as New Relic, Datadog, AppDynamics, or Dynatrace.
- Strong proficiency in Infrastructure as Code (IaC) using Terraform.
- Solid understanding of cloud platforms including AWS, GCP, or Azure.
- Experience with automation/configuration management tools like Ansible, Chef, or Puppet.
- Deep knowledge of CI/CD pipelines and tools such as GitHub Actions, Jenkins, or Azure DevOps.
- Experience managing Kubernetes and containerized environments (Docker, Helm).
- Familiarity with log aggregation and analysis platforms like ELK Stack or Splunk.
- Excellent leadership, communication, and collaboration skills.
Benefits
- Health insurance
- Retirement Plan
- Generous Paid Time Off
- 401k Matching