We are seeking a highly skilled and experienced Manager of Cloud Operations with a strong focus on Site Reliability Engineering (SRE) to lead our team in ensuring the reliability, performance, and scalability of our cloud-based infrastructure.
Requirements
- Lead, mentor, and develop a team of DevOps and SRE engineers.
- Implement and promote SRE principles and practices across the organization.
- Define and monitor service level objectives (SLOs), service level indicators (SLIs), and service level agreements (SLAs).
- Develop and implement incident response and post-mortem processes.
- Drive automation of operational tasks and infrastructure management.
- Design, implement, and maintain scalable and resilient infrastructure on Azure and/or AWS.
- Implement infrastructure-as-code (IaC) using tools like Terraform.
- Ensure security and compliance of cloud environments.
- Manage CI/CD pipelines for automated deployments.
- Implement and maintain comprehensive monitoring and alerting systems.
- Communicate effectively with stakeholders at all levels.
- Responsible for hiring the right team for the product.
Benefits
- Professional growth and Development opportunities.
- Working within a team of friendly, skilled people where help is always within reach
- Flexible working hours
- 4 recharge days
- High-end laptop
- Competitive pay and bonus
- 18 vacation days in a year
- 15 days Sick Leave/ Casual leave per calendar year
- 16 hours of paid volunteer time off per year
- Wedding gift and newborn gift allowance for employees.
- 26 weeks of paid maternity leave and one week of paid paternity leave.
- 12 wellness leaves for women employees
- Health Insurance of up to 7 lacs
- Group Term Insurance coverage up to three times of their Annual CTC.
- Group Personal Accident coverage up to three times of Annual CTC.
- Provident fund contributions