AssemblyAI builds the best-in-class Speech AI models powering the next generation of voice applications. We're looking for a Senior Research Engineer to join our streaming speech-to-text research team. The role will own end-to-end and integration-level model evaluation across accuracy, latency, and feature-specific metrics. You'll work directly with our research and engineering teams to ensure our evaluations reflect real-world integration scenarios.
Requirements
- ML fundamentals: You understand how ML models are trained and evaluated well enough to interpret results and debug issues.
- Strong Python skills: You can write clean evaluation scripts, work with data pipelines, and are comfortable with SQL and cloud infrastructure.
- Metric intuition: You understand what makes a good evaluation metric, when to use relative vs. absolute improvements, and how to ensure statistical rigor.
- Voice agent stack familiarity: You understand how the components of a voice agent system interact—VAD, ASR, turn detection, LLM, TTS—and can reason about how changes in one affect the others.
- Tinkerer mentality: You'd rather ship something rough and iterate than spend weeks perfecting it. You're energized by variety.
- Communication skills: You can explain technical results to researchers, summarize findings for leadership, and translate customer feedback into requirements.
- Ownership mindset: You don't wait to be told what to evaluate. You see gaps and fill them.
- Will need to work at least 3-4 hours overlapping with Eastern US Time Zone
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Visa Sponsorship
- Four Day Work Week
- Generous Parental Leave
- Tuition Reimbursement
- Relocation Assistance