We're hiring an AI Infrastructure Engineer to shape and scale the backend systems that power our AI platform.
Requirements
- Design and implement scalable backend architectures for AI workloads (inference, orchestration, monitoring).
- Own distributed job orchestration with Temporal and related systems.
- Improve data pipeline performance by designing smarter caching strategies to reduce redundant compute and API calls.
- Build observability, monitoring, retries, and fault tolerance into all workflows.
- Manage infrastructure reliability, incident response, and performance.
- Develop tooling and platform infrastructure to support rapid growth.
- Partner with ML engineers to bring models to production at scale.