← all repositories

LantaoYu/SeqGAN

A TensorFlow implementation of SeqGAN for generating sequences of discrete tokens using GANs with policy gradient reinforcement learning.

2.1k stars Python Language ModelsML Frameworks
SeqGAN
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

This repository implements Sequence Generative Adversarial Nets with Policy Gradient, applying GANs to discrete token sequence generation. The generator uses policy gradient where the discriminator provides reward signals passed back via Monte Carlo search. The code includes a two-stage training process: supervised pre-training with maximum likelihood estimation, followed by adversarial training to improve the generator.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.