We are looking for a DevOps Engineer to instrument, maintain, and optimize monitoring platforms for one of our banking clients, with a focus on observability, incident detection, and remediation.
Requirements
- Experience with observability instrumentation tools like Dynatrace, Grafana, Zabbix, or similar
- Experience with metrics, logs, and traces for critical service monitoring (SLO/SLA desirable)
- Design of log and metric ingestion pipelines (Fluent Bit, Beats, Kafka, or others)
- Scripting and automation with Python, Bash, Terraform, Ansible
- Solid knowledge of Kubernetes/EKS, cloud services (AWS, Azure, or GCP), and databases
- Knowledge of infrastructure and networking
- Experience in advanced troubleshooting and root cause analysis (RCA) using AIOps practices and distributed tracing
- Experience in agile methodologies (Scrum/Kanban) and tools like Jira/Confluence