We’re looking for an Observability Engineer to design and evolve intelligent monitoring solutions across hybrid and cloud-native environments. This role sits at the heart of platform reliability — turning telemetry into insight, noise into signal, and outages into opportunities for improvement.
Requirements
- Design, automate, and optimise enterprise observability platforms for logging, metrics, and tracing
- Build and enhance monitoring solutions using tools such as: SolarWinds Observability, ManageEngine App Manager, Prometheus, Grafana, and other open-source tooling, Microsoft Azure monitoring services
- Consolidate and analyse application and system logs at scale, including distributed tracing
- Integrate observability tooling with ITSM platforms to improve incident response and operational workflows
- Automate deployment and configuration of monitoring infrastructure using Terraform, Ansible, or similar
- Develop scripts using Python, Bash, or PowerShell to streamline operational tasks
- Implement telemetry collection using protocols such as SNMP, WMI, SSH, APIs, and traps
- Support observability for containerised and cloud-native workloads
- Promote consistent telemetry standards and interoperability across platforms
Benefits
- Variety of work
- Opportunity to interact with a wide range of experts
- Inclusive work environment
- Equal opportunities
- Flexible working
- 50% of working time can be spent in the office