Voltai is the leading AI company building agentic systems and frontier foundation models for semiconductor and electronics design. We are seeking a Machine Learning Operations professional to design, build, and maintain scalable ML pipelines for training, evaluation, and deployment of LLMs and retrieval-augmented systems.
Requirements
- Design, build, and maintain scalable ML pipelines for training, evaluation, and deployment of LLMs and retrieval-augmented systems
- Operationalize evaluation workflows using both synthetic and human-labeled datasets to monitor model quality at scale across multiple downstream tasks and customer deployments
- Automate the ML Developer lifecycle by implementing robust data versioning, model tracking, and CI/CD pipelines using modern ML Ops tooling
- Optimize model training and inference, focusing on reducing latency, maximizing throughput, and controlling cost across heterogeneous hardware environments
- Collaborate cross-functionally with research, infrastructure, and product teams to productionize foundation models and integrate them into customer-facing AI products
- Deploy and manage both open-source and proprietary models within stringent constraints on latency, security, and compliance—balancing reliability with innovation
- Implement real-time monitoring and alerting systems to detect model/data drift, quality regressions, and infrastructure bottlenecks in live environments
- Work directly with enterprise customers, supporting deployment strategies, ensuring production readiness, and creating tight feedback loops from real-world usage to continuous model improvement
Benefits
- Unlimited PTO
- Comprehensive Health Coverage
- Free Meals and Snacks
- Professional Growth
- Visa Sponsorship