PhonePe is looking for a Site Reliability Engineer 2 to troubleshoot issues across the entire stack, improve the reliability and performance of distributed systems, and design build and maintain core infrastructure. The ideal candidate will have strong hands-on experience in Linux/Unix System Administration, expertise in managing and scaling proxy infrastructure, and knowledge of database technologies.
Requirements
- Troubleshoot issues across the entire stack - hardware, software, application, and network
- Work to improve the reliability and performance of the next generation of distributed systems and containerized deployments
- Diagnose and troubleshoot complex distributed systems handling millions of queries per second
- Design build and maintain core infrastructure that enables Phonepe scaling to support hundreds of thousands of concurrent users
- Participate in on call rotation
- Drive performance testing, capacity planning and high availability practices
- Own implementations of new technologies while ensuring proper testing and documentation
- Proactively monitor/identify/solve issues which could have a potential impact to our Infrastructure
Benefits
- Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance
- Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System
- Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program
- Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy
- Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment
- Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy