Kyubyong/transformer
TensorFlow implementation of the Transformer architecture for German-to-English machine translation.

Velocity · 7d
+1.4
★ / day
Trend
→steady
star history
This repository provides a reference implementation of the Transformer model introduced in the 2017 paper Attention Is All You Need. It implements the encoder-decoder architecture with self-attention mechanisms for sequence-to-sequence tasks, specifically machine translation. The code uses TensorFlow 1.12 and trains on the IWSLT 2016 German-English parallel corpus with BPE tokenization, data preprocessing, and configurable hyperparameters.