chengzeyi/ParaAttention
Context-parallel attention library that accelerates diffusion transformer model inference with dynamic caching.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
ParaAttention implements Ulysses and Ring style parallelism for attention computation to speed up DiT model inference. It supports dynamic caching strategies and integrates with popular diffusion libraries including Hugging Face diffusers. The library specifically targets image and video generation models such as FLUX-dev and Wan 2.1 for text-to-image, text-to-video, and image-to-video tasks.