← all repositories

chenyme/Chenyme-AAVT

A fully automated pipeline that transcribes audio with Whisper, translates subtitles using GPT-4/4o, and merges them into a translated video.

3.1k stars Python Data ToolingLanguage Models
Chenyme-AAVT
Velocity · 7d
+3.4
★ / day
Trend
steady
star history

This project automates video translation by chaining three AI stages: it first uses faster-whisper to transcribe spoken audio into text, then feeds the transcripts to GPT-4 or GPT-4o for subtitle translation, and finally muxes the translated subtitles back into the original video. It supports batch processing and is distributed as a Python package, Docker image, and Google Colab notebook.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.