Is Megatron-LM open source?

Yes — NVIDIA/Megatron-LM is an open-source project tracked on heatdrop.

What language is Megatron-LM written in?

NVIDIA/Megatron-LM is primarily written in Python.

How popular is Megatron-LM?

NVIDIA/Megatron-LM has 17.2k stars on GitHub and is currently accelerating.

Where can I find Megatron-LM?

NVIDIA/Megatron-LM is on GitHub at https://github.com/NVIDIA/Megatron-LM.

← all repositories

NVIDIA/Megatron-LM

NVIDIA's kitchen-sink transformer trainer, now with a library inside

Megatron-LM splits into a reference training stack and a composable core for anyone who needs to squeeze every FLOP from a GPU cluster.

★17.2k stars Python Language Models ML Frameworks Inference · Serving

View on GitHub ↗ Homepage ↗

Velocity · 7d

+14

★ / day

Trend

↗accelerating

star history

What it does

Megatron-LM is NVIDIA’s research codebase for training very large transformers across thousands of GPUs. It recently reorganized into two pieces: Megatron-LM (reference scripts for researchers and quick experiments) and Megatron Core (a composable library of GPU-optimized building blocks for framework builders). The Core handles the usual parallelism alphabet soup—TP, PP, DP, EP, CP—plus mixed precision down to FP4, and now exports to TensorRT-LLM.

The interesting bit

The project is refreshingly candid about its own benchmarking: it reports end-to-end MFU including data loading, logging, and checkpointing, not just the happy kernel path. It also notes that performance results were “measured without training to convergence”—a small honesty that saves you from chasing phantom numbers.

Key highlights

Benchmarked up to 462B parameters on 6,144 H100 GPUs, hitting 47% MFU
Superlinear weak scaling (41% → 47-48% MFU) as larger GEMMs get more efficient
Strong scaling a ~175B model from 96 to 4,608 GPUs drops MFU from 47% to 42%—communication overhead exposed
Megatron Bridge provides bidirectional Hugging Face ↔ Megatron checkpoint conversion
Now developed openly on GitHub with community contributions accepted

Caveats

Python 3.10 support is being dropped in the upcoming 0.17.0 release; minimum will be 3.12
Building from source can exhaust memory without limiting parallel compilation jobs
The README’s “model table” image is referenced but not described in text—what it actually compares is unclear

Verdict

Worth studying if you’re building a training framework or running large-scale LLM experiments on NVIDIA hardware. If you’re fine-tuning 7B models on a single A100, this is overkill with a side of configuration headache.

Frequently asked

What is NVIDIA/Megatron-LM?: Megatron-LM splits into a reference training stack and a composable core for anyone who needs to squeeze every FLOP from a GPU cluster.
Is Megatron-LM open source?: Yes — NVIDIA/Megatron-LM is an open-source project tracked on heatdrop.
What language is Megatron-LM written in?: NVIDIA/Megatron-LM is primarily written in Python.
How popular is Megatron-LM?: NVIDIA/Megatron-LM has 17.2k stars on GitHub and is currently accelerating.
Where can I find Megatron-LM?: NVIDIA/Megatron-LM is on GitHub at https://github.com/NVIDIA/Megatron-LM.