360CVGroup/Qihoo-T2X
An efficient Diffusion Transformer model for text-to-any generation tasks, published at ICLR 2025.
★446 stars Image · Video · Audio

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
Qihoo-T2X is an efficient Diffusion Transformer architecture designed for text-to-any generation tasks, including text-to-image synthesis. The approach uses proxy-tokenized diffusion to improve computational efficiency while maintaining generation quality. The model was accepted for presentation at ICLR 2025.