News

Text-to-speech models from ElevenLabs, Hume AI, and Descript are all pushing the limits of AI-generated voice technology.
Tens of trillions of language tokens Unlike traditional text-to-speech systems that rely on limited speech datasets, Octave TTS is built on an LLM trained on tens of trillions of language tokens.
Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...
The future wave of innovation will likely be concerned with personalization, enabling readers to personalize the voice, tempo ...
AI text-to-speech programs could “unlearn” how to imitate certain people New research shows models can be directly edited to hide selected voices, even when users specifically ask for them.
Text-to-speech with feeling - this new AI model does everything but shed a tear ElevenLabs' 'most expressive' v3 model can speak with a huge range of emotions in more than 70 languages.
Podcast recording and editing platform Podcastle is now joining other companies in the AI-powered, text-to-speech race by releasing its own AI model called Asyncflow v1.0. An API for developers ...