← all repositories

mit-han-lab/radial-attention

A sparse attention mechanism with O(nlogn) complexity that accelerates diffusion-based video generation models while preserving output quality.

radial-attention
Velocity · 7d
+1.7
★ / day
Trend
steady
star history

Radial Attention implements a sparse attention pattern designed to reduce the computational overhead of transformer attention in video generation models. It achieves near-linear complexity scaling, enabling single-GPU video generation in 33-90 seconds on H100/4090 GPUs. The project integrates with multiple video generation frameworks including Wan2.1, HunyuanVideo, and Mochi-1, and is compatible with techniques like SageAttention and LoRA for further optimization.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.