PhonePe is seeking a Site Reliability Engineer 3 to join their team. The ideal candidate will have 7-13 years of experience in Linux/Unix System Administration and expertise in managing and scaling proxy infrastructure. The role will involve troubleshooting issues, improving system reliability and performance, and participating in on-call rotation.
Requirements
- Troubleshoot issues across the entire stack - hardware, software, application, and network
- Work to improve the reliability and performance of the next generation of distributed systems and containerized deployments
- Diagnose and troubleshoot complex distributed systems handling millions of queries per second
- Participate in on call rotation
- Design build and maintain core infrastructure that enables Phonepe scaling to support hundreds of thousands of concurrent users
- Actively take part in the Analysis and System improvement plan
- Drive performance testing, capacity planning and high availability practices
- Own implementations of new technologies while ensuring proper testing and documentation
- Proactively monitor/identify/solve issues which could have a potential impact to our Infrastructure
Benefits
- Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance
- Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System
- Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program
- Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy
- Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment
- Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy