Voltai is a leading AI company building agentic systems and frontier foundation models for semiconductor and electronics design. We're seeking a Machine Learning Systems Engineer to design and maintain high-performance ML pipelines for training, evaluation, and inference of LLMs and retrieval-augmented systems, with a focus on hardware efficiency and throughput.
Requirements
- Design and maintain high-performance ML pipelines for training, evaluation, and inference of LLMs and retrieval-augmented systems
- Optimize core transformer operations at the kernel level, designing and tuning custom kernels and low-level implementations for GPU-accelerated workloads
- Implement and integrate low-precision computation techniques to reduce memory footprint and accelerate inference with minimal accuracy degradation
- Build and maintain inference engines for on premises deployments
- Architect distributed training and inference systems
- Collaborate closely with researchers and infra teams to bring cutting-edge model innovations into production
- Interface directly with enterprise hardware environments, tuning performance based on real-world deployment constraints
Benefits
- Unlimited PTO
- Comprehensive Health Coverage
- Free Meals and Snacks
- Professional Growth
- Visa Sponsorship