chenyme/Chenyme-AAVT
A fully automated pipeline that transcribes audio with Whisper, translates subtitles using GPT-4/4o, and merges them into a translated video.

Velocity · 7d
+3.4
★ / day
Trend
→steady
star history
This project automates video translation by chaining three AI stages: it first uses faster-whisper to transcribe spoken audio into text, then feeds the transcripts to GPT-4 or GPT-4o for subtitle translation, and finally muxes the translated subtitles back into the original video. It supports batch processing and is distributed as a Python package, Docker image, and Google Colab notebook.