Universal Music Group is seeking a Senior Observability Engineer to lead the architectural design and strategic roadmap for their observability stack, ensuring the reliability, performance, and scalability of their critical IT systems. The role requires hands-on experience in observability, site reliability engineering, or DevOps, with a proven track record of leading complex projects and mentoring technical teams.
Requirements
- 5-7+ years of hands-on experience in an Observability, Site Reliability Engineering (SRE), or DevOps role
- Demonstrated experience in architecting and designing large-scale monitoring and observability solutions
- Deep expertise with modern observability platforms
- Advanced knowledge of major cloud platforms, containerization, and Infrastructure as Code
- Strong programming and scripting skills with a focus on creating scalable automation and custom tooling
- Exceptional analytical and strategic problem-solving skills, with the ability to lead through complex technical challenges
- Expertise in analysing and visualising telemetry data into meaningful information to drive actions
- Hands-on engineering and coding experience, ability to deep-dive into existing and emerging technologies to identify opportunities and solutions
- Understanding of container technologies and container orchestration platforms to monitor and manage containerized applications
- Understanding of networking principles and protocols to effectively monitor and troubleshoot network-related issues
- Awareness of security best practices and the ability to integrate security monitoring into observability processes
- Excellent communication and interpersonal skills, capable of articulating a technical vision to diverse audiences and influencing senior stakeholders
Benefits
- Competitive salary
- Benefits package
- Opportunities for professional growth and development