Capital Markets Gateway LLC is looking for a Site Reliability Engineer with a focus on monitoring, observability, and alerting to ensure the reliability, performance, and scalability of our infrastructure and applications. The successful candidate will design, implement, and maintain monitoring solutions to provide visibility into system health and performance, proactively detect anomalies, and reduce incident response time.
Requirements
- Must be based in Latin America
- English level - C1 or C2
- Proven experience as a Site Reliability Engineer or similar role
- Proficiency in logging, metrics, and tracing frameworks (DataDog, Loki, Prometheus, OpenTelemetry)
- Experience with cloud platforms (Azure preferred) and infrastructure-as-code tools (e.g., Terraform)
- Strong programming and scripting skills (Python, Bash)
- Proficiency in containerization technologies and orchestration tools (Docker, Kubernetes)
- Understanding of Linux-based systems, networking, and security principles related to containerized applications
- Strong problem-solving and troubleshooting skills, with a passion for identifying and resolving complex technical issues
- Excellent communication and collaboration abilities
- Ability to thrive in a fast-paced, constantly evolving environment
- Experience with PostgreSQL monitoring and optimization (Optional/Nice to have)
Benefits
- 2 year+ contract
- 15 business days of vacation
- Tech courses and conferences
- Top-of-the-line MacBook
- Flexible working hours