There are seemingly a million Text to Speech and voice clone things being released. I'm going to note what I find, or my buddy Nate sends me here. (Also some other interesting tools will be noted too ;)). I'm noting the date of their first github commit on the repository too.
TTS (Text to Speech)
LuxTTS: https://github.com/ysharma3501/LuxTTS 1/23/2026
Qwen3TTS: https://github.com/QwenLM/Qwen3-TTS 1/22/2026
PersonaPlex: https://research.nvidia.com/labs/adlr/personaplex/ 1/15/2026
Chroma: https://github.com/FlashLabs-AI-Corp/FlashLabs-Chroma 11/13/2025
NeuTTS: https://github.com/neuphonic/neutts 10/2/2025
Inworld TTS: https://github.com/inworld-ai/tts 7/29/2025
Pocket TTS: https://github.com/kyutai-labs/pocket-tts
Soprano TTS: https://github.com/ekwek1/soprano
Sopro TTS: https://github.com/samuel-vitorino/sopro
VibeVoice: https://github.com/microsoft/VibeVoice
Liquid Audio: https://github.com/Liquid4All/liquid-audio
VoxTream TTS: https://github.com/herimor/voxtream
Mimo Audio: https://github.com/XiaomiMiMo/MiMo-Audio
VoxCPM TTS; https://github.com/OpenBMB/VoxCPM
... There are plenty more, I'll update as I have time.
STT (Speech to Text)
Whisper
GLM ASR Nano: https://github.com/zai-org/GLM-ASR
Tools:
Qwen3-Audiobook-Converter: https://github.com/WhiskeyCoder/Qwen3-Audiobook-Converter
Meta Segment Anything Model Audio: (No link, I have facebook/meta blocked at my house, and don't feel like turning off my blocker).
* Note I got hit with a rate limit checking the commit history of the repos soooo dunno when I'll have dates on some of these.