AssemblyAI builds the best-in-class Speech AI models powering the next generation of voice applications. We're looking for a Senior Research Engineer to join our streaming speech-to-text research team. The role will own end-to-end and integration-level model evaluation across accuracy, latency, and feature-specific metrics. You'll work directly with our research and engineering teams to ensure our evaluations reflect real-world integration scenarios.

Requirements

ML fundamentals: You understand how ML models are trained and evaluated well enough to interpret results and debug issues.
Strong Python skills: You can write clean evaluation scripts, work with data pipelines, and are comfortable with SQL and cloud infrastructure.
Metric intuition: You understand what makes a good evaluation metric, when to use relative vs. absolute improvements, and how to ensure statistical rigor.
Voice agent stack familiarity: You understand how the components of a voice agent system interact—VAD, ASR, turn detection, LLM, TTS—and can reason about how changes in one affect the others.
Tinkerer mentality: You'd rather ship something rough and iterate than spend weeks perfecting it. You're energized by variety.
Communication skills: You can explain technical results to researchers, summarize findings for leadership, and translate customer feedback into requirements.
Ownership mindset: You don't wait to be told what to evaluate. You see gaps and fill them.
Will need to work at least 3-4 hours overlapping with Eastern US Time Zone

Benefits

Generous Paid Time Off
401k Matching
Retirement Plan
Visa Sponsorship
Four Day Work Week
Generous Parental Leave
Tuition Reimbursement
Relocation Assistance

Requirements

ML fundamentals: You understand how ML models are trained and evaluated well enough to interpret results and debug issues.

Strong Python skills: You can write clean evaluation scripts, work with data pipelines, and are comfortable with SQL and cloud infrastructure.

Metric intuition: You understand what makes a good evaluation metric, when to use relative vs. absolute improvements, and how to ensure statistical rigor.

Voice agent stack familiarity: You understand how the components of a voice agent system interact—VAD, ASR, turn detection, LLM, TTS—and can reason about how changes in one affect the others.

Tinkerer mentality: You'd rather ship something rough and iterate than spend weeks perfecting it. You're energized by variety.

Communication skills: You can explain technical results to researchers, summarize findings for leadership, and translate customer feedback into requirements.

Ownership mindset: You don't wait to be told what to evaluate. You see gaps and fill them.

Will need to work at least 3-4 hours overlapping with Eastern US Time Zone

Research Engineer, Evaluations

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Research Engineer, Evaluations

Research Engineer, Voice

Senior Research Engineer, Voice + Speech

Research Engineer, Evaluations

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Research Engineer, Evaluations

Research Engineer, Voice

Senior Research Engineer, Voice + Speech

Job Details

About AssemblyAI