← all repositories
Helsinki-NLP/Opus-MT

Open-source MT that admits its own test scores lie

A ready-to-run neural translation stack built on Marian-NMT, with pre-trained models for hundreds of language pairs and a Tornado web UI that gets you translating in minutes.

Opus-MT
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

What it does

Opus-MT packages neural machine translation into a runnable service: download models, point a JSON config at them, and serve translations through a browser UI or API. It wraps Marian-NMT with either a Tornado web app (with a Wikimedia pedigree) or an experimental WebSocket server, and ships hundreds of CC-BY 4.0 pre-trained models covering an impressive sprawl of language pairs.

The interesting bit

The authors are refreshingly blunt about quality. The README lists five known issues unprompted: test scores are inflated by simple Tatoeba sentences, some training data leaks into test sets, there’s no filtering or backtranslation, many models hit a 72-hour GPU wall before converging, and validation data often doesn’t match real use. That’s unusual transparency in an MT project.

Key highlights

  • Pre-trained models for hundreds of language pairs, CC-BY 4.0 licensed, downloadable from a matrix view
  • Two service setups: Tornado web app with UI/API, or lighter WebSocket server with experimental extensions
  • Docker and Docker+GPU support for quick deployment; manual setup needs Marian-NMT compiled with -DCOMPILE_SERVER=ON
  • SentencePiece segmentation and eflomal word alignments as standard pipeline
  • Training scripts exist but are currently hardcoded to University of Helsinki/CSC infrastructure

Caveats

  • Training your own models is basically non-portable right now; the Makefile assumes their specific environment
  • The authors explicitly warn that automatic evaluation scores are “too optimistic” on realistic data
  • No quality control beyond automatic tests; no domain adaptation, data augmentation, or filtering in current models

Verdict

Good fit if you need quick, no-strings-attached translation for many language pairs and can tolerate the quality caveats. Skip it if you need production-grade MT with reliability guarantees, or if you want to train custom models without access to Finnish supercomputing infrastructure.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.