Is Opus-MT open source?

Yes — Helsinki-NLP/Opus-MT is open source, released under the MIT license.

What language is Opus-MT written in?

Helsinki-NLP/Opus-MT is primarily written in Python.

How popular is Opus-MT?

Helsinki-NLP/Opus-MT has 836 stars on GitHub.

Where can I find Opus-MT?

Helsinki-NLP/Opus-MT is on GitHub at https://github.com/Helsinki-NLP/Opus-MT.

← all repositories

Helsinki-NLP/Opus-MT

Open-source MT that admits its own test scores lie

A ready-to-run neural translation stack built on Marian-NMT, with pre-trained models for hundreds of language pairs and a Tornado web UI that gets you translating in minutes.

★836 stars Python Language Models Inference · Serving

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does

Opus-MT packages neural machine translation into a runnable service: download models, point a JSON config at them, and serve translations through a browser UI or API. It wraps Marian-NMT with either a Tornado web app (with a Wikimedia pedigree) or an experimental WebSocket server, and ships hundreds of CC-BY 4.0 pre-trained models covering an impressive sprawl of language pairs.

The interesting bit

The authors are refreshingly blunt about quality. The README lists five known issues unprompted: test scores are inflated by simple Tatoeba sentences, some training data leaks into test sets, there’s no filtering or backtranslation, many models hit a 72-hour GPU wall before converging, and validation data often doesn’t match real use. That’s unusual transparency in an MT project.

Key highlights

Pre-trained models for hundreds of language pairs, CC-BY 4.0 licensed, downloadable from a matrix view
Two service setups: Tornado web app with UI/API, or lighter WebSocket server with experimental extensions
Docker and Docker+GPU support for quick deployment; manual setup needs Marian-NMT compiled with -DCOMPILE_SERVER=ON
SentencePiece segmentation and eflomal word alignments as standard pipeline
Training scripts exist but are currently hardcoded to University of Helsinki/CSC infrastructure

Caveats

Training your own models is basically non-portable right now; the Makefile assumes their specific environment
The authors explicitly warn that automatic evaluation scores are “too optimistic” on realistic data
No quality control beyond automatic tests; no domain adaptation, data augmentation, or filtering in current models

Verdict

Good fit if you need quick, no-strings-attached translation for many language pairs and can tolerate the quality caveats. Skip it if you need production-grade MT with reliability guarantees, or if you want to train custom models without access to Finnish supercomputing infrastructure.

Frequently asked

What is Helsinki-NLP/Opus-MT?: A ready-to-run neural translation stack built on Marian-NMT, with pre-trained models for hundreds of language pairs and a Tornado web UI that gets you translating in minutes.
Is Opus-MT open source?: Yes — Helsinki-NLP/Opus-MT is open source, released under the MIT license.
What language is Opus-MT written in?: Helsinki-NLP/Opus-MT is primarily written in Python.
How popular is Opus-MT?: Helsinki-NLP/Opus-MT has 836 stars on GitHub.
Where can I find Opus-MT?: Helsinki-NLP/Opus-MT is on GitHub at https://github.com/Helsinki-NLP/Opus-MT.