miccunifi/ladi-vton
A latent diffusion model enhanced with textual inversion that generates virtual try-on images of people wearing garments.

Velocity · 7d
+0.4
★ / day
Trend
→steady
star history
LaDI-VTON is a virtual try-on system that synthesizes images of a target person wearing a given garment. It extends a latent diffusion model with a novel autoencoder module and learnable skip connections to enhance garment preservation and person fidelity. The approach uses textual inversion to enable garment-specific customization and was published at ACM Multimedia 2023.