Wednesday, January 28, 2026

Text To Speech and Voice Cloning

 There are seemingly a million Text to Speech and voice clone things being released. I'm going to note what I find, or my buddy Nate sends me here. (Also some other interesting tools will be noted too ;)). I'm noting the date of their first github commit on the repository too.

TTS (Text to Speech)

TinyTTS: https://github.com/tronghieuit/tiny-tts.git

KittenTTS: https://github.com/KittenML/KittenTTS 8/4/2025

LuxTTS: https://github.com/ysharma3501/LuxTTS 1/23/2026

Qwen3TTS: https://github.com/QwenLM/Qwen3-TTS 1/22/2026 

PersonaPlex: https://research.nvidia.com/labs/adlr/personaplex/ 1/15/2026 

Chroma: https://github.com/FlashLabs-AI-Corp/FlashLabs-Chroma 11/13/2025

NeuTTS: https://github.com/neuphonic/neutts 10/2/2025 

Inworld TTS: https://github.com/inworld-ai/tts 7/29/2025

Pocket TTS: https://github.com/kyutai-labs/pocket-tts

Soprano TTS: https://github.com/ekwek1/soprano 

Sopro TTS: https://github.com/samuel-vitorino/sopro

VibeVoice: https://github.com/microsoft/VibeVoice

Liquid Audio: https://github.com/Liquid4All/liquid-audio

VoxTream TTS: https://github.com/herimor/voxtream 

Mimo Audio: https://github.com/XiaomiMiMo/MiMo-Audio 

VoxCPM TTS;  https://github.com/OpenBMB/VoxCPM

 

... There are plenty more, I'll update as I have time. 

STT (Speech to Text)

Whisper 

GLM ASR Nano: https://github.com/zai-org/GLM-ASR

Moonshine: https://github.com/moonshine-ai/moonshine

Tools:

Qwen3-Audiobook-Converter: https://github.com/WhiskeyCoder/Qwen3-Audiobook-Converter

Meta Segment Anything Model Audio: (No link, I have facebook/meta blocked at my house, and don't feel like turning off my blocker).

In browser TTS:

https://github.com/MbBrainz/ttslab 

 

 * Note I got hit with a rate limit checking the commit history of the repos soooo dunno when I'll have dates on some of these.