← all repositories

conceptofmind/LaMDA-rlhf-pytorch

Open-source PyTorch implementation of Google's LaMDA language model with RLHF training.

468 stars Python Language Models
LaMDA-rlhf-pytorch
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

This repository provides a pre-training implementation of Google’s LaMDA model architecture in PyTorch. It replicates key architectural features including T5 relative positional bias, gated GELU activations, and a decoder-only autoregressive design. The project incorporates reinforcement learning from human feedback (RLHF) to enable training similar to ChatGPT, alongside HuggingFace dataset integration for efficient data loading and a GPT-2 compatible tokenizer.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.