← all repositories

LTH14/JiT

A PyTorch implementation of JiT, a minimalist transformer-based diffusion model for high-resolution image generation.

2.4k stars Python Image · Video · Audio
JiT
Velocity · 7d
+11
★ / day
Trend
steady
star history

JiT is a pixel-space diffusion model that denoises images directly in pixel space rather than using latent compression. This PyTorch re-implementation reproduces the research paper “Back to Basics: Let Denoising Generative Models Denoise” by Li et al., originally implemented in JAX/TPU. The model supports training on ImageNet at 256x256 resolution with transformer architectures such as JiT-B/16.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.