We're looking for an experienced ML Ops Engineer to join the ML/AI team at Newsela, responsible for taking models from prototype to production, building robust data pipelines, and maintaining services running smoothly.
Requirements
- Design and maintain CI/CD pipelines for ML model training, packaging, and deployment across microservices.
- Manage containerized services on AWS ECS, optimizing for cost, latency, and availability.
- Automate infrastructure provisioning and service configuration with Terraform.
- Work to maintain and scale services that make use of third party LLM providers.
- Build and improve data pipelines that feed models from BigQuery, S3, and DynamoDB into training and inference workflows.
- Instrument services with observability tooling and establish SLOs for model-serving endpoints.
- Collaborate with ML engineers to productionize new models using BentoML, FastAPI, and container-based serving.
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Tuition Reimbursement
- Visa Sponsorship