← all repositories
rushindrasinha/youtube-shorts-pipeline

YouTube Shorts factory: one command, 11 cents, zero creativity required

Verticals v3 automates the entire pipeline from news research to uploaded video using niche-tuned AI profiles.

2k stars Python AgentsCreative · Design
youtube-shorts-pipeline
Velocity · 7d
+19
★ / day
Trend
steady
star history

What it does

Feed it a topic and a niche—python -m verticals run --topic "..." --niche tech—and it researches via DuckDuckGo, writes a script with hook patterns, generates B-roll images, records a voiceover, burns in animated captions, adds mood-matched music, creates a thumbnail, and uploads to YouTube. Wall clock time is about three minutes; API cost is roughly $0.11 per video at default settings. It can also run entirely free using local Ollama + Edge TTS.

The interesting bit

The “niche intelligence” system is the actual engine. Every stage reads from a YAML profile that shapes tone, visual style, caption fonts, music mood, and thumbnail strategy without you writing a single prompt. The tech niche forbids “what’s up guys” and demands contrarian hooks; the cooking niche generates food photography and warm handwritten captions. Fifteen niches ship built-in, and you can author your own by editing a YAML file.

Key highlights

  • Multi-provider LLM support: Claude, Gemini, GPT, Ollama local, or Claude CLI (free with Max subscription)
  • Three TTS tiers: free Edge TTS (300+ voices, recommended default), premium ElevenLabs, or macOS say fallback
  • Anti-hallucination gate: LLM is instructed to use only facts from live DuckDuckGo research, not training knowledge
  • ffmpeg assembly with Ken Burns motion, automatic voice ducking, and word-level Whisper captions in ASS + SRT
  • Resumable stages and individual stage CLI commands (draft, produce, upload)
  • YouTube upload is private-by-default; TikTok, Reels, and X export are v3.1 roadmap items

Caveats

  • Gradio UI, Docker, Colab, and Kokoro TTS are explicitly listed as “not shipped yet”—roadmap only
  • Image generation relies on Gemini Imagen with solid-color fallback frames if it fails; no Pexels or Replicate integration yet
  • The repo went through a “credibility cleanup” (v3.0.1) after a viral traffic spike, suggesting the docs previously oversold capabilities

Verdict

Worth a look if you run a faceless channel, test content hypotheses, or just want to see how far you can push automated ffmpeg pipelines. Skip it if you care about platform diversity today or find the idea of AI-generated “contrarian takes” existentially depressing.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.