Abacus Insights is a mission-led technology company that is transforming how data works for health plans. The Principal Sustaining and Forward Deployed Engineer will be responsible for production operations, incident response, and post-launch system reliability across the company's platform.
Requirements
- 10+ years of experience in software engineering, SRE, sustaining engineering, or production operations
- Deep hands-on experience operating production systems in AWS
- Strong experience troubleshooting Databricks and large-scale data platforms
- Proficiency in Python and experience building production services or tooling
- Strong understanding of distributed systems, incident management and RCA practices, monitoring, alerting, and observability, CI/CD Pipelines that leverage Infrastructure as Code.
- Proven ability to own problems end-to-end, from detection to permanent resolution
- Excellent communication skills, especially during incidents and customer escalations
- Ability to work backward from customer impact to root cause across systems and codebases, delivering fixes in environments with minimal documentation.
Benefits
- Unlimited paid time off
- Work from anywhere
- Comprehensive health coverage
- Equity for every employee
- Growth-focused environment
- Home office setup allowance
- Monthly cell phone allowance