hkproj/pytorch-transformer
A PyTorch implementation of the Transformer architecture introduced in the 2017 'Attention is all you need' paper.

Velocity · 7d
+1.1
★ / day
Trend
→steady
star history
This repository contains a step-by-step implementation of the Transformer model, the neural network architecture that powers modern large language models. Built in PyTorch as a Jupyter Notebook, it serves as a learning resource for understanding self-attention mechanisms, encoder-decoder structures, and positional encodings. The implementation is accompanied by a YouTube walkthrough video.