← all repositories

tunz/transformer-pytorch

A PyTorch implementation of the Transformer neural network architecture for sequence-to-sequence tasks like machine translation.

492 stars Python Language ModelsML Frameworks
transformer-pytorch
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

This repository provides a reference implementation of the Transformer model in PyTorch, matching the original “Attention is All You Need” paper architecture. It supports training on standard NLP benchmarks including WMT32k (German-English translation) and lm1b datasets. Users can train the model and use a decoder script to translate sentences with a trained model.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.