Neuberger Berman's Technology team is seeking an Observability Engineer to lead and evolve our observability strategy across cloud and on-premise environments.
Requirements
- Partner closely with application, DevOps engineering, SRE/operations, infrastructure, and security teams to understand reliability goals and translate them into scalable monitoring/observability solutions
- Design, build, and maintain scalable observability architectures and platforms
- Develop automated processes to continuously scan and validate uptime/health for business-critical services
- Implement and optimize telemetry collection, alerting, dashboards, and service views
- Define and operationalize SLOs and implement actionable alerting strategies
- Implement and evolve APM capabilities and user experience monitoring
- Integrate observability tooling with incident/problem management processes and ITSM workflows
- Automate onboarding and configuration for telemetry, dashboards, monitors, and alerts using scripting and infrastructure-as-code
- Collaborate on platform evolution and cost/scale optimization
- Champion and evangelize observability practices and tooling adoption across technology teams
Benefits
- Paid time off
- Medical/dental/vision insurance
- Retirement
- Life insurance
- Other benefits