← all repositories

miccunifi/ladi-vton

A latent diffusion model enhanced with textual inversion that generates virtual try-on images of people wearing garments.

464 stars Python Image · Video · Audio
ladi-vton
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

LaDI-VTON is a virtual try-on system that synthesizes images of a target person wearing a given garment. It extends a latent diffusion model with a novel autoencoder module and learnable skip connections to enhance garment preservation and person fidelity. The approach uses textual inversion to enable garment-specific customization and was published at ACM Multimedia 2023.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.