We are seeking a skilled and independent Site Reliability Engineer to join our client's engineering team for a project-based engagement focused on Production Engineering and Site Reliability Engineering (SRE). This role requires a proven ability to deliver technical solutions, triage and resolve complex production issues, and work independently while collaborating with engineering and infrastructure teams when necessary.
Requirements
- Design, develop, test, and deploy automation tools, scripts, and engineering solutions to improve the stability, performance, and efficiency of production systems.
- Identify opportunities to automate manual operational processes and reduce operational overhead.
- Support and improve the release and deployment lifecycle of applications, ensuring reliable and controlled production rollouts.
- Collaborate with software engineers and infrastructure teams to troubleshoot and resolve system issues.
- Contribute to system design discussions, platform management, and capacity planning.
- Create and maintain clear technical documentation for automation tools, operational procedures, and reliability improvements.
- Provide regular updates on progress and deliverables to engineering stakeholders.
Benefits
- Opportunity to work on production systems, solve complex technical challenges, and improve reliability through automation