Wednesday, January 28, 2026

Text To Speech and Voice Cloning

 There are seemingly a million Text to Speech and voice clone things being released. I'm going to note what I find, or my buddy Nate sends me here. (Also some other interesting tools will be noted too ;)). I'm noting the date of their first github commit on the repository too.

TTS (Text to Speech)

LuxTTS: https://github.com/ysharma3501/LuxTTS 1/23/2026

Qwen3TTS: https://github.com/QwenLM/Qwen3-TTS 1/22/2026 

PersonaPlex: https://research.nvidia.com/labs/adlr/personaplex/ 1/15/2026 

Chroma: https://github.com/FlashLabs-AI-Corp/FlashLabs-Chroma 11/13/2025

NeuTTS: https://github.com/neuphonic/neutts 10/2/2025 

Inworld TTS: https://github.com/inworld-ai/tts 7/29/2025

Pocket TTS: https://github.com/kyutai-labs/pocket-tts

Soprano TTS: https://github.com/ekwek1/soprano 

Sopro TTS: https://github.com/samuel-vitorino/sopro

VibeVoice: https://github.com/microsoft/VibeVoice

Liquid Audio: https://github.com/Liquid4All/liquid-audio

VoxTream TTS: https://github.com/herimor/voxtream 

Mimo Audio: https://github.com/XiaomiMiMo/MiMo-Audio 

VoxCPM TTS;  https://github.com/OpenBMB/VoxCPM

 

... There are plenty more, I'll update as I have time. 

STT (Speech to Text)

Whisper 

GLM ASR Nano: https://github.com/zai-org/GLM-ASR

Tools:

Qwen3-Audiobook-Converter: https://github.com/WhiskeyCoder/Qwen3-Audiobook-Converter

Meta Segment Anything Model Audio: (No link, I have facebook/meta blocked at my house, and don't feel like turning off my blocker).


 

 * Note I got hit with a rate limit checking the commit history of the repos soooo dunno when I'll have dates on some of these.