← all repositories

hkproj/pytorch-transformer

A PyTorch implementation of the Transformer architecture introduced in the 2017 'Attention is all you need' paper.

1.2k stars Jupyter Notebook LearningLanguage Models
pytorch-transformer
Velocity · 7d
+1.1
★ / day
Trend
steady
star history

This repository contains a step-by-step implementation of the Transformer model, the neural network architecture that powers modern large language models. Built in PyTorch as a Jupyter Notebook, it serves as a learning resource for understanding self-attention mechanisms, encoder-decoder structures, and positional encodings. The implementation is accompanied by a YouTube walkthrough video.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.