Join SimCorp, a leading provider of integrated investment management solutions, as a Senior Site Reliability Engineer. Work on Cloud Native Products & Services, taking ownership of monitoring, observability, release management, and more. Drive stability, continuous improvement, and operational excellence in Azure-based environments.
Requirements
- Support the operational and enhancement of mission-critical environments for both new and existing Cloud Native products & services
- Collaborate with product development teams to enhance monitoring, observability, reliability, and performance of these services
- Manage & improve our infrastructure deployment pipelines and troubleshoot onboarding and operational issues
- Drive capacity planning efforts to ensure our platform is resilient and scalable as we grow
- Build tools and automation to eliminate manual TOIL, improve engineering velocity, developer experience, and improve system reliability
- Define and manage SLOs and error budgets in partnership with Engineering teams
- Contribute to incidents, problems, and change management processes
- Execute disaster recovery, configuration management, and platform readiness tasks
- Collaborate with Agile teams and take part in design discussions with clients, vendors, and stakeholders
- Contribute to knowledge sharing across multiple Product Areas
Benefits
- Global hybrid work policy
- Inclusive and diverse company culture
- Work-life balance
- Empowerment
- Career & Growth