← all repositories
myshell-ai/OpenVoice

Clone any voice, speak any language, keep the accent

A research-backed TTS system that separates tone color from style, letting you copy a voice's timbre while freely adjusting emotion, rhythm, and language.

36.6k stars Python Image · Video · Audio
OpenVoice
Velocity · 7d
+40
★ / day
Trend
steady
star history

What it does OpenVoice takes a short audio sample of someone’s voice and generates new speech in that voice. It handles multilingual output and lets you tweak style parameters—emotion, accent, rhythm, pauses, intonation—independently from the core vocal identity. V2 adds native support for English, Spanish, French, Chinese, Japanese, and Korean, plus better audio quality and an MIT license for commercial use.

The interesting bit The architecture decouples “tone color” (who it sounds like) from “style” (how they speak). This means you can clone a voice the model never saw during training, in a language the training data barely covered—zero-shot cross-lingual cloning without the usual massive multilingual speaker dataset requirement.

Key highlights

  • Accurate tone color cloning from short reference audio
  • Granular style control: emotion, accent, rhythm, pauses, intonation
  • Zero-shot cross-lingual: reference and target languages need not appear in training data
  • V2 trained with a different strategy for improved audio quality
  • MIT licensed since April 2024; free for commercial use
  • Powers myshell.ai’s voice cloning, reportedly used tens of millions of times by Nov 2023
  • Built on VITS/VITS2 and Coqui TTS foundations

Caveats

  • The README is high-level; actual setup complexity and hardware requirements live in separate docs (USAGE.md, QA.md) not shown here
  • “Better audio quality” in V2 is claimed but no objective metrics or comparisons are provided
  • Acknowledges heavy reliance on prior TTS/VITS work—this is evolutionary engineering, not from-scratch architecture

Verdict Worth a look if you need flexible, commercially usable voice cloning with style knobs. Skip if you want a polished one-click product; this is a research codebase with docs you actually have to read.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.