microsoft/StyleSwin
A Transformer-based GAN architecture for high-resolution image synthesis, achieving state-of-the-art results on CelebA-HQ and FFHQ benchmarks.

This repository provides the official PyTorch implementation of the StyleSwin paper, a generative adversarial network that replaces convolutional layers in StyleGAN with Swin Transformer blocks. The architecture achieves high-resolution image generation by leveraging the local attention mechanism of Swin transformers while maintaining the style-mixing capabilities of StyleGAN. It supports generation at multiple resolutions including 256x256 and 1024x1024.