tunz/transformer-pytorch
A PyTorch implementation of the Transformer neural network architecture for sequence-to-sequence tasks like machine translation.

Velocity · 7d
+0.2
★ / day
Trend
→steady
star history
This repository provides a reference implementation of the Transformer model in PyTorch, matching the original “Attention is All You Need” paper architecture. It supports training on standard NLP benchmarks including WMT32k (German-English translation) and lm1b datasets. Users can train the model and use a decoder script to translate sentences with a trained model.