← all repositories
akanimax/BMSG-GAN

Skip the progressive-growing song and dance

A PyTorch GAN that feeds multi-scale gradients directly from generator to discriminator, trading training choreography for stability.

BMSG-GAN
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

What it does

MSG-GAN synthesizes images by wiring intermediate generator layers straight into matching discriminator layers at multiple resolutions. Instead of the progressive-growing scheme from ProGAN—where layers fade in one by one—this architecture keeps all layers active from the start and lets gradients flow at every scale simultaneously. The discriminator concatenates downsampled real images with activation volumes from its main convolutional path.

The interesting bit

The README’s training time-lapse is the tell: early frames show solid color blocks at high resolutions that gradually synchronize brightness and detail across all scales. It’s a visual admission that the network starts chaotic and self-organizes, rather than the carefully staged growth of ProGAN. The authors claim this makes the model “robust” and less sensitive to hyperparameters; they recommend a single learning rate (0.003) for both generator and discriminator and default to relativistic-hinge loss.

Key highlights

  • Trained at resolutions from 32×32 (CIFAR-10) up to 1024×1024 (CelebA-HQ)
  • Supports standard GAN, WGAN-GP, LSGAN, hinge, and relativistic-hinge losses
  • Includes equalized learning rate and exponential moving average options
  • AWS SageMaker integration added
  • Not the official research repo—the authors direct paper readers to their TensorFlow StyleGAN-based implementation instead

Caveats

  • README notes this is not the official MSG-GAN paper code; that lives in a separate TensorFlow repository
  • Hardware requirements are steep: reported experiments used 2× Tesla V100 on a DGX-1
  • No pretrained model weights provided in this repo

Verdict

Worth a look if you want ProGAN-like multi-scale generation in PyTorch without managing progressive-growing schedules. Skip it if you need the official paper’s trained models or are GPU-poor.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.