We are seeking a highly skilled and experienced Senior Infrastructure Engineer to join our dynamic team. The ideal candidate will be passionate about building and maintaining complex systems, with a holistic approach to architecture. You will play a key role in designing, implementing, and managing cloud infrastructure, ensuring scalability, availability, security, and optimal performance.
Requirements
- Design, architect, and implement scalable, highly available, and secure infrastructure solutions, primarily on Google Cloud Platform (GCP) and/or Amazon Web Services (AWS).
- Develop and maintain Infrastructure as Code (IaC) using Terraform for enterprise-scale maintainability and repeatability.
- Utilize Kubernetes deployment tools such as Helm/Kustomize in combination with GitOps tools such as ArgoCD for container orchestration and management.
- Design and implement CI/CD pipelines using platforms like GitHub, GitLab, Bitbucket, Cloud Build, Harness, etc., with a focus on rolling deployments, canaries, and blue/green deployments.
- Ensure auditability and observability of pipeline states.
- Implement security best practices, audit, and compliance requirements within the infrastructure.
- Provide technical mentorship and training to engineering staff.
- Engage with clients to understand their technical and business requirements, and provide tailored solutions.
- Troubleshoot and resolve complex infrastructure issues.
- Lead and participate in incident response, troubleshooting, and root cause analysis for production issues.
- Manage incident lifecycle activities including triage, escalation, communication, and post-incident reviews.
- Monitor application and infrastructure health using observability platforms and monitoring tools.
- Define and maintain SLIs, SLOs, and error budgets to improve service reliability.