← all repositories

lucidrains/PaLM-pytorch

A PyTorch implementation of the PaLM (Pathways Language Model) Transformer architecture for scaled language modeling.

824 stars Python Language ModelsML Frameworks
PaLM-pytorch
Velocity · 7d
+0.5
★ / day
Trend
steady
star history

This repository provides a clean implementation of the PaLM Transformer architecture in under 200 lines of code. It replicates the key components of Google’s Pathways Language Model including SwiGLU activation, parallel attention layers, and RoPE embeddings. The implementation supports configurable model dimensions up to the 540B parameter scale described in the original paper and includes training scripts for language modeling benchmarks like Enwik8.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.