Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems.
Requirements
- Have proven experience with flow matching, diffusion models, auto regressive networks in the audio domain.
- Have experience training deep learning models: from medium-sized to large models.
- Have experience building streaming text-to-speech models or speech-to-speech models
- Have strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping.
- Know state-of-the-art architectures in representation learning: audio or image domain, face animation (in addition to having a deep understanding of the direct field of expertise above)
- Have excellent programming skills and be fluent in PyTorch
- Show evidence of original research, with publications in top-tier or solid second-tier venues (e.g., CVPR, NeurIPS, BMVC or equivalent).
- Be excited about building lifelike, expressive avatars for real-time applications.
- PhD or equivalent experience preferred
- Experience leading research teams
- Knowledge of best practices in Software Development
Benefits
- Flexible work schedule
- Unlimited PTO
- Competitive healthcare
- Gear stipends
- Fun