← all repositories

kuleshov-group/mdlm

A research project introducing MDLM, a masked discrete diffusion language model with substitution-based parameterization published at NeurIPS 2024.

696 stars Python Language Models
mdlm
Velocity · 7d
+1.0
★ / day
Trend
steady
star history

The repository implements a novel masked diffusion approach for training language models. Instead of traditional autoregressive training, it uses a substitution-based parameterization that simplifies the absorbing state diffusion loss into a mixture of masked language modeling losses. The model achieves state-of-the-art perplexity results and offers an alternative to standard language model training approaches using diffusion-based denoising.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.