← all repositories

chengzeyi/ParaAttention

Context-parallel attention library that accelerates diffusion transformer model inference with dynamic caching.

ParaAttention
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

ParaAttention implements Ulysses and Ring style parallelism for attention computation to speed up DiT model inference. It supports dynamic caching strategies and integrates with popular diffusion libraries including Hugging Face diffusers. The library specifically targets image and video generation models such as FLUX-dev and Wan 2.1 for text-to-image, text-to-video, and image-to-video tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.