Upscaling for mortals: no training data required
A practical super-resolution model that learns real-world degradation from pure synthetic data, with dedicated anime support.

What it does Real-ESRGAN upscales and restores images and video using a GAN-based approach. It ships with pre-trained models for general photos, anime stills, and anime video, plus optional face enhancement via GFPGAN integration. You can run it through Python, portable NCNN executables (no PyTorch/CUDA needed), or online demos.
The interesting bit The model is trained entirely on synthetic data that simulates real-world corruption—JPEG artifacts, noise, blur—rather than collecting expensive paired high/low-res datasets. There’s a dedicated, compact anime model (6B parameters vs the full-size version) and even a tiny “general-x4v3” model with adjustable denoising strength.
Key highlights
- Portable NCNN binaries for Windows/Linux/macOS that run on Intel/AMD/Nvidia GPUs without installing anything
- Tile-based inference for large images, plus support for alpha channels, grayscale, and 16-bit depth
- Arbitrary output scales via cheap LANCZOS4 resize after the GAN pass
- Training code and fine-tuning guides published; built on the BasicSR toolbox
- Face enhancement mode via GFPGAN integration
Caveats
- The NCNN executables may show block inconsistencies at tile boundaries and differ slightly from PyTorch outputs
- Portable executables lack some Python features like
outscale
Verdict Grab this if you need practical upscaling without managing a training pipeline—especially for anime content or batch photo restoration. Skip if you need pixel-perfect scientific upscaling or can’t tolerate occasional tile artifacts in the portable builds.