A TTS Research Scientist position is available at Cerence Inc. in Seoul. The role involves designing and optimizing text/NLP preprocessing pipelines, integrating language models, and developing solutions for emotion/style control in synthesized speech.
Requirements
- Design and optimize text/NLP preprocessing pipelines with Deep Learning or Machine Learning methods
- Integrate language models to improve contextual and semantic understanding for natural intonation
- Develop rule-based and neural solutions for emotion/style control in synthesized speech
- Build state-of-the-art acoustic models to map linguistic features to spectrograms or waveform parameters
- Optimize neural vocoders for high-fidelity, real-time speech synthesis
- Optimize inference latencies for both edge devices and cloud platforms
- Enhance robustness through noise suppression, speaker adaptation, and multilingual/cross-language/cross-gender voice cloning
Benefits
- Equal Employment Opportunity (EEO)
- Zero-tolerance policy for workplace violence