← all repositories
awslabs/sockeye

AWS's NMT toolkit is now in maintenance-only mode

A battle-tested PyTorch seq2seq framework that once powered Amazon Translate, now frozen in place.

1.2k stars Python ML FrameworksLanguage Models
sockeye
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

What it does Sockeye is a sequence-to-sequence framework built specifically for Neural Machine Translation on PyTorch. It handles distributed training and optimized inference for transformer-based models, and it has been the engine behind Amazon Translate and other production MT systems.

The interesting bit The project just completed a long, awkward migration from MXNet to PyTorch. Version 3.0 ran both frameworks side-by-side with a conversion CLI (sockeye.mx_to_pt); version 3.1.x dropped MXNet entirely. The catch: models converted from MXNet via 3.0’s tool are incompatible with 3.1.x. So much for backward compatibility.

Key highlights

  • Powers production workloads (Amazon Translate) and a long list of academic papers
  • Supports distributed training and GPU-optimized inference
  • Includes a WMT 2014 English-German tutorial for quickstarts
  • Optional NVIDIA Apex integration for faster GPU training
  • Extensive documentation and developer guidelines available

Caveats

  • Maintenance mode only: no new features are being added
  • MXNet-to-PyTorch migration path is broken between 3.0.x and 3.1.x for converted models
  • Continued training of converted MXNet models was never supported (optimizer states don’t transfer)

Verdict Worth studying if you’re maintaining legacy NMT infrastructure or need a stable, well-documented PyTorch baseline. Skip it if you want active development or modern LLM-based approaches; this is a finished monument, not a moving train.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.