PixArt-alpha/PixArt-sigma
A diffusion transformer model that generates high-resolution 4K images from text prompts.

Velocity · 7d
+2.3
★ / day
Trend
→steady
star history
PixArt-Σ is a text-to-image generation model based on a diffusion transformer architecture. It uses a weak-to-strong training methodology to progressively improve model capability. The repository provides PyTorch model definitions, pre-trained weights, and inference/sampling code, enabling generation of 4K-resolution images from text descriptions.