Is VideoLingo open source?

Yes — Huanshere/VideoLingo is open source, released under the Apache-2.0 license.

What language is VideoLingo written in?

Huanshere/VideoLingo is primarily written in Python.

How popular is VideoLingo?

Huanshere/VideoLingo has 17.8k stars on GitHub and is currently cooling off.

Where can I find VideoLingo?

Huanshere/VideoLingo is on GitHub at https://github.com/Huanshere/VideoLingo.

← all repositories

Huanshere/VideoLingo

The Pipeline That Refuses to Do Multi-Line Subtitles

It automates video transcription, translation, and dubbing while stubbornly insisting on single-line, human-readable subtitles.

★17.8k stars Python Domain Apps Image · Video · Audio

View on GitHub ↗ Homepage ↗

Velocity · 7d

+14

★ / day

Trend

↘cooling

star history

What it does

VideoLingo ingests a video—often straight from YouTube via yt-dlp—and runs it through word-level transcription with WhisperX, a three-step translation and reflection stage, and optional dubbing. It outputs Netflix-style single-line subtitles paired with synthesized audio, all wrapped in a Streamlit interface that lets you pause, resume, or swap models mid-process. You can run the stack entirely offline with Ollama and Edge-TTS or hand off the heavy lifting to commercial APIs.

The interesting bit

The project treats subtitle formatting as a first-class problem, not an afterthought: it uses NLP segmentation and a custom terminology engine to prevent the usual machine-translation awkwardness, and it actively rejects multi-line subtitles as a matter of policy. That obsession with line breaks is oddly refreshing in a space that usually ignores readability.

Key highlights

Enforces single-line subtitles only, aiming for Netflix-grade readability
Three-step “Translate-Reflect-Adaptation” pipeline with AI-generated terminology glossaries
Word-level alignment via WhisperX, with voice separation for noisy sources
Supports multiple TTS backends including GPT-SoVITS, Azure, OpenAI, and custom integrations
Task-level pause/resume controls and progress resumption via detailed logging

Caveats

WhisperX struggles with background noise and can truncate subtitles ending in numbers or special characters
Multilingual source videos will lose non-dominant languages during transcription
Weaker LLMs may crash the pipeline with JSON parsing errors, requiring you to wipe the output folder and restart

Verdict

Content creators and localization teams drowning in manual subtitle work should take a look. If you need reliable multilingual source handling or flawless multi-character dubbing, this isn’t there yet.

Frequently asked

What is Huanshere/VideoLingo?: It automates video transcription, translation, and dubbing while stubbornly insisting on single-line, human-readable subtitles.
Is VideoLingo open source?: Yes — Huanshere/VideoLingo is open source, released under the Apache-2.0 license.
What language is VideoLingo written in?: Huanshere/VideoLingo is primarily written in Python.
How popular is VideoLingo?: Huanshere/VideoLingo has 17.8k stars on GitHub and is currently cooling off.
Where can I find VideoLingo?: Huanshere/VideoLingo is on GitHub at https://github.com/Huanshere/VideoLingo.