Open-source MT that admits its own test scores lie
A ready-to-run neural translation stack built on Marian-NMT, with pre-trained models for hundreds of language pairs and a Tornado web UI that gets you translating in minutes.

What it does
Opus-MT packages neural machine translation into a runnable service: download models, point a JSON config at them, and serve translations through a browser UI or API. It wraps Marian-NMT with either a Tornado web app (with a Wikimedia pedigree) or an experimental WebSocket server, and ships hundreds of CC-BY 4.0 pre-trained models covering an impressive sprawl of language pairs.
The interesting bit
The authors are refreshingly blunt about quality. The README lists five known issues unprompted: test scores are inflated by simple Tatoeba sentences, some training data leaks into test sets, there’s no filtering or backtranslation, many models hit a 72-hour GPU wall before converging, and validation data often doesn’t match real use. That’s unusual transparency in an MT project.
Key highlights
- Pre-trained models for hundreds of language pairs, CC-BY 4.0 licensed, downloadable from a matrix view
- Two service setups: Tornado web app with UI/API, or lighter WebSocket server with experimental extensions
- Docker and Docker+GPU support for quick deployment; manual setup needs Marian-NMT compiled with
-DCOMPILE_SERVER=ON - SentencePiece segmentation and eflomal word alignments as standard pipeline
- Training scripts exist but are currently hardcoded to University of Helsinki/CSC infrastructure
Caveats
- Training your own models is basically non-portable right now; the Makefile assumes their specific environment
- The authors explicitly warn that automatic evaluation scores are “too optimistic” on realistic data
- No quality control beyond automatic tests; no domain adaptation, data augmentation, or filtering in current models
Verdict
Good fit if you need quick, no-strings-attached translation for many language pairs and can tolerate the quality caveats. Skip it if you need production-grade MT with reliability guarantees, or if you want to train custom models without access to Finnish supercomputing infrastructure.