An open-source ML platform that moved to GitLab
MLReef tried to be the GitLab for machine learning, then became exactly that — on GitLab.

What it does
MLReef is an open-source ML-Ops platform that wraps data management, experiment tracking, and pipeline orchestration into one web-based workspace. It versions datasets through Git LFS, containerizes Python scripts via argparse parameters, and runs jobs on Kubernetes, cloud, or bare metal. Think of it as a attempt to glue together the scattered toolchain of modern ML work.
The interesting bit
The project auto-containerizes your scripts just because you used argparse — no Dockerfile hand-crafting required. It also captures “non-committed local changes” in experiment logs, which is the kind of paranoid reproducibility measure that suggests the authors have been burned by “it worked on my laptop” more than once.
Key highlights
- Data hosting with Git/Git LFS versioning and external storage connectors
- Auto-containerization of Python scripts based on argparse parameters
- Experiment tracking with environment snapshots, metrics comparison, and artifact storage
- Pipeline orchestration for K8s, cloud, and bare-metal deployments
- Supports PyTorch, TensorFlow, Keras, and Scikit-Learn
Caveats
- This GitHub repository is abandoned. The README explicitly states: “We are no longer supporting and updating this repository.” Active development moved to GitLab.
- The “thousands of other users” claim in the README is unsupported by the 1,460 stars and lacks any usage metrics in the sources.
- Several documentation links (docs.mlreef.com, developer_guide.md) are referenced but not verified working in the provided sources.
Verdict
Worth a look if you want a self-hosted, Git-centric ML platform and don’t mind chasing the project to GitLab. Skip it if you need a actively maintained community on GitHub or want something that doesn’t require running your own infrastructure.