← all repositories
Ceeon/videocut-skills

Claude Code becomes your video editor's unpaid intern

A Chinese-language talking-head video agent that actually understands when you flub a line, unlike Jianying's pattern-matching approach.

1.9k stars JavaScript AgentsData Tooling
videocut-skills
Velocity · 7d
+13
★ / day
Trend
steady
star history

What it does

Videocut Skills is a Claude Code Skill (think plugin) that automates the tedious cleanup of Chinese talking-head videos — the ums, repeated sentences, dead air, and verbal stumbles that editors usually hunt through manually. It transcribes via Volcano Engine ASR, runs semantic analysis through Claude, presents cuts in a browser for human approval, then fires FFmpeg to do the actual surgery.

The interesting bit

The author built this because Jianying’s “smart cut” only does pattern matching — it can’t tell when you’ve said something twice with slightly different wording, or when you started a sentence wrong and corrected yourself. The agent uses Claude to actually parse meaning, plus a custom dictionary file for tech terms like “MCP” that ASR routinely botches. There’s even a “self-update” skill where you tell it preferences (“keep some ums as transitions”) and it remembers.

Key highlights

  • Detects 8 distinct problem types: silence >0.3s, repeated sentence openings, intra-sentence repetition, filler words, stutters, corrections, and more
  • One documented run: 19-minute raw video → 608 detected issues → AI-assisted cutdown
  • Browser-based review UI with click-to-play, double-click toggle, shift-drag multi-select
  • Optional 2-pass sharpening export and Whisper-based subtitle burn-in with dictionary correction
  • Rules live as markdown files the LLM reads — no retraining, just editing text

Caveats

  • Requires Volcano Engine API key (Chinese cloud ASR), plus ~5GB of local models (FunASR + Whisper large-v3)
  • Audio uploads to uguu.se by default — you’ll want to check that privacy tradeoff
  • Port 8899 hardcoded for the review server; conflicts mean manual debugging

Verdict

Chinese-speaking developers who regularly edit their own talking-head content should try this — it genuinely reduces the cognitive load of finding “the good take.” Everyone else, or anyone allergic to cloud ASR dependencies, can wait for a more portable fork.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.