Is MOSS-TTS-Nano open source?

Yes — OpenMOSS/MOSS-TTS-Nano is open source, released under the Apache-2.0 license.

What language is MOSS-TTS-Nano written in?

OpenMOSS/MOSS-TTS-Nano is primarily written in Python.

How popular is MOSS-TTS-Nano?

OpenMOSS/MOSS-TTS-Nano has 4k stars on GitHub and is currently cooling off.

Where can I find MOSS-TTS-Nano?

OpenMOSS/MOSS-TTS-Nano is on GitHub at https://github.com/OpenMOSS/MOSS-TTS-Nano.

← all repositories

OpenMOSS/MOSS-TTS-Nano

A 100M-parameter voice cloner that runs on your laptop CPU

It exists to make voice cloning small enough to run on a four-core CPU without sounding like a 1990s answering machine.

★4k stars Python Image · Video · Audio

View on GitHub ↗ Homepage ↗

Velocity · 7d

+6.4

★ / day

Trend

↘cooling

star history

What it does MOSS-TTS-Nano is a 100-million-parameter text-to-speech model that generates speech from text and a short audio prompt for voice cloning. It outputs 48 kHz stereo audio across twenty languages, from Chinese and English to Arabic and Turkish. The project is explicitly aimed at deployment scenarios where model size and hardware requirements matter more than studio-grade fidelity.

The interesting bit The team ships a fully standalone ONNX version that drops the PyTorch dependency entirely during inference, which means it can run inside a browser extension or on a MacBook Air M4 using a single CPU core. That architecture choice—an autoregressive Audio Tokenizer feeding a small LLM—trades absolute quality for the ability to stream audio in real time on modest hardware.

Key highlights

Only 0.1B parameters; designed for CPU inference without a GPU
Supports 20 languages with automatic chunked processing for long inputs
ONNX runtime variant claims nearly 2× the processing efficiency of the original PyTorch version in the team’s own tests
Companion browser extension can run the ONNX model directly inside the browser without a separate local inference service
Native 48 kHz, 2-channel output with streaming decode

Caveats

Dependency setup can be finicky: the README warns that WeTextProcessing and pynini often fail to install cleanly and points to a GitHub issue for platform-specific wheel workarounds
The project describes its own quality target as “good enough for realtime products,” which suggests it is not aiming to compete with larger cloud TTS models on fidelity

Verdict Worth a look if you need embeddable, offline voice cloning for a desktop app, browser extension, or low-traffic web service. Skip it if you need broadcast-quality narration or heavy server-side batch processing where GPU clusters are already available.

Frequently asked

What is OpenMOSS/MOSS-TTS-Nano?: It exists to make voice cloning small enough to run on a four-core CPU without sounding like a 1990s answering machine.
Is MOSS-TTS-Nano open source?: Yes — OpenMOSS/MOSS-TTS-Nano is open source, released under the Apache-2.0 license.
What language is MOSS-TTS-Nano written in?: OpenMOSS/MOSS-TTS-Nano is primarily written in Python.
How popular is MOSS-TTS-Nano?: OpenMOSS/MOSS-TTS-Nano has 4k stars on GitHub and is currently cooling off.
Where can I find MOSS-TTS-Nano?: OpenMOSS/MOSS-TTS-Nano is on GitHub at https://github.com/OpenMOSS/MOSS-TTS-Nano.