We’re looking for a Software Engineer 3 to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native experiences in MongoDB Atlas.
Requirements
- Design and build components of a multi-tenant inference platform integrated directly with MongoDB Atlas
- Collaborate with AI engineers and researchers to productionize inference for embedding models and rerankers
- Contribute to platform capabilities such as latency-aware routing, model versioning, health monitoring, and observability
- Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native environment
- Work across product, infrastructure, and ML teams to ensure the inference platform meets the scale, reliability, and latency demands of Atlas users
- Gain hands-on experience with tools like vLLM and container orchestration with Kubernetes
Benefits
- Flexible paid time off
- 20 weeks fully-paid gender-neutral parental leave
- Fertility and adoption assistance
- 401(k) plan
- Mental health counseling
- Access to transgender-inclusive health insurance coverage
- Health benefits offerings