Image · Video · Audio

underdogs breaking out

+92% /wk +252 ★/day→steady

It gives Claude Code and Codex a motion-design vocabulary—106 shot recipes, 161 previews, and a Remotion template—so they can direct cinematic product videos instead of generic slideshows.

★ 1.9k TypeScript Agents · explained

dramaclaw/dramaclaw

+70% /wk +211 ★/day↗accelerating

It bundles the entire AI drama workflow—script, storyboard, voice, and final cut—into a single pipeline you can host yourself.

★ 2.1k TypeScript Image · Video · Audio · explained Feature

pyang5166/gbro-collage-broll

+61% /wk +70 ★/day→steady

An agent skill that forces you to approve the visual metaphor and static layout before it spends your Gemini credits on video generation.

★ 807 Python Image · Video · Audio · explained Feature

img2threejs/img2threejs

+59% /wk +482 ★/day→steady

It exists to rebuild objects from reference photos as token-efficient, animation-ready Three.js code, using agent vision to gate quality before every sculpting pass.

★ 5.8k Python Image · Video · Audio · explained Feature

Orkas-AI/Orkas-VideoStudio

+26% /wk +20 ★/day→steady

OrkasVideoStudio gives coding agents a deterministic, local-first toolkit for composing, editing, and generating video from plain-language prompts.

★ 530 TypeScript Agents · explained

basketikun/infinite-canvas

+22% /wk +121 ★/day↗accelerating

A self-hostable infinite canvas that wires AI image generation, reference editing, and chat into one collaborative workspace.

★ 3.9k TypeScript Creative · Design · explained

AutoArk/GPA

+18% /wk +45 ★/day↗accelerating

GPA aims to unify speech recognition, text-to-speech, and voice conversion in one compact autoregressive model so you can stop juggling separate audio pipelines.

★ 1.7k Python Image · Video · Audio · explained

moonshine-ai/moonshine

+17% /wk +256 ★/day↗accelerating

An on-device voice toolkit that ditches the 30-second window and redundant re-processing that makes Whisper feel sluggish for live speech.

★ 10.5k C++ Image · Video · Audio · explained

palmier-io/palmier-pro

+13% /wk +240 ★/day↗accelerating

Palmier Pro exists to turn a Swift-native video editor into a shared workspace where AI agents can read and write the timeline via MCP.

★ 12.5k Swift Agents · explained Feature

jatinkrmalik/vocalinux

+12% /wk +12 ★/day↗accelerating

Vocalinux is a fully offline, GPLv3 voice dictation app that pipes transcribed text into any Linux application on X11 or Wayland.

★ 674 Python Image · Video · Audio · explained

PurpleDoubleD/locally-uncensored

+11% /wk +16 ★/day↗accelerating

A Tauri desktop app that auto-detects a dozen local AI backends so you don't have to wrestle with Docker or API keys.

★ 966 TypeScript Chat Assistants · explained

lidge-jun/ima2-gen

+11% /wk +9.9 ★/day↗accelerating

It exists because cloud image generators deserve a local memory layer, a branching canvas, and a UI outside the chat thread.

★ 614 TypeScript Image · Video · Audio · explained

techjarves/Uncensored-Local-Studio

+11% /wk +12 ★/day→steady

It unifies Stable Diffusion, GGUF chat, Whisper, and Kokoro TTS into a single offline desktop GUI so you can skip cloud APIs, subscriptions, and censorship filters.

★ 734 JavaScript Inference · Serving · explained

Emily2040/seedance-2.0

+11% /wk +83 ★/day↘cooling

A modular agent OS that directs Seedance 2.0 video generation with film-production discipline—shot contracts, continuity rules, and retake budgets—instead of vague prompt dumps.

★ 5.4k Python Agents · explained

VigoZhao/AI-Visual-Prompt-Cookbook

+9.1% /wk +6.7 ★/day→steady

Most AI image prompts are one-off text blobs; this repo distills 96 visual styles into structured JSON templates so you can swap variables without losing style direction.

★ 512 Python Image · Video · Audio · explained

xuanyustudio/LocalMiniDrama

+9.0% /wk +13 ★/day→steady

LocalMiniDrama wires your API keys into a Vue+Electron pipeline that turns story outlines into short-form AI video without shipping data to anyone else's cloud.

★ 973 JavaScript Image · Video · Audio · explained

HBAI-Ltd/Toonflow-app

+8.0% /wk +145 ★/day↗accelerating

Toonflow exists to turn a manuscript into an animated short drama without juggling five different browser tabs.

★ 12.7k TypeScript Image · Video · Audio · explained

StarTrail-org/PixelRAG

+7.5% /wk +78 ★/day↗accelerating

PixelRAG renders documents into screenshot tiles and retrieves them visually, preserving tables and layout that HTML parsers strip away.

★ 7.3k Python RAG · Search · explained

lightningpixel/modly

+7.0% /wk +46 ★/day↗accelerating

It wraps open-source image-to-3D models in a desktop app so your snapshots never leave your GPU.

★ 4.6k TypeScript Image · Video · Audio · explained

altic-dev/FluidVoice

+6.5% /wk +83 ★/day↘cooling

FluidVoice brings fully offline speech-to-text and AI-enhanced formatting to any macOS text field, offering an open-source alternative to cloud dictation services.

★ 8.9k Swift Image · Video · Audio · explained

loading more…