Scale AI is a data foundation for AI, helping organizations build and deploy reliable production AI applications. As a Senior/Staff Machine Learning Engineer on the General Agents team, you'll design, build, and deploy production-ready AI agents that solve high-impact enterprise problems.
Requirements
- Design and implement end-to-end agent systems that combine LLM reasoning, tool use, memory, and control logic to solve recurring enterprise use cases.
- Build scalable, reliable agent architectures that can be deployed across many customers with varying data, tools, and constraints.
- Develop evaluation frameworks, datasets, environments, and metrics to measure agent performance, reliability, and business impact in production settings.
- Collaborate closely with product managers, customers, data annotators, and other engineering teams to translate enterprise requirements into robust agent designs.
- Productionize frontier agent techniques (e.g., planning, multi-step reasoning and tool-use, multi-agent patterns) into maintainable, observable systems.
- Own deployment, monitoring, and iteration of agent systems, including failure analysis and continuous improvement based on real-world usage.
- Contribute to technical direction and architectural decisions for general agent development best practices and methods, with increasing scope and leadership at the Staff level.
Benefits
- Comprehensive health, dental and vision coverage
- Retirement benefits
- Learning and development stipend
- Generous PTO
- Commuter stipend (may be eligible)