OptionMetrics is seeking an IT Operations Manager to lead and evolve our production environment with a strong sense of ownership, urgency, and accountability. The role requires someone who leans in during incidents, drives root cause resolution, and builds systems that prevent recurrence.
Requirements
- End-to-end ownership of production operations, including cloud infrastructure, batch processing, and deployment environments.
- Reliability and uptime of critical systems, with a strong focus on SLAs, monitoring, and incident response.
- Operational excellence across AWS environments, ensuring systems are scalable, secure, and cost-efficient.
- Batch processing architecture, including optimization, scheduling, and failure recovery of daily and intraday jobs.
- CI/CD pipelines and release processes, driving automation and reducing deployment risk.
- Monitoring, alerting, and observability frameworks, ensuring rapid detection and resolution of issues.
- Incident management and post-mortem culture, with a focus on accountability and continuous improvement.
- Automation-first mindset, reducing manual intervention across all operational workflows.
Benefits
- Paid time off: Vacation, Personal, Sick days, and Holidays.
- Pre-tax commuter benefits (NJT, MTA, etc.)
- 401k plan offered
- Full medical and dental insurance coverage