releaseMarch 26, 2026

KittenTTS releases 25MB nano model for CPU text-to-speech

KittenTTS now offers nano, micro and mini text-to-speech models, with the smallest int8 build under 25MB and built for ONNX CPU inference. Creators can run local voice tools without a cloud round trip.

Local Inference Voice

2 min read

KittenTTS releases 25MB nano model for CPU text-to-speech

TL;DR

KittenTTS has released three local text-to-speech models — nano, micro, and mini — with the smallest int8 nano build coming in under 25MB for ONNX-based CPU inference, according to the GitHub page.
The stack is aimed at fully local voice generation: the project page says it runs without a GPU, ships with eight built-in voices, supports speed control, and outputs 24 kHz audio.
For creators, the practical hook is lightweight offline voice work for prototypes, tools, and embedded experiences; the HN launch post frames the release around compact multi-voice, expressive speech synthesis.
The main caveat is that small model size does not automatically mean friction-free deployment, as the discussion roundup highlights questions about dependency bloat, latency, streaming, and expressive control.