Kaggle, but you can actually see the wiring
An open-source platform for hosting AI competitions that keeps evaluation honest by forcing everyone onto the same track.

What it does EvalAI is a self-hostable platform for running machine-learning competitions: participants submit models, organizers define evaluation protocols, and the system ranks everything on a public or private leaderboard. It handles the boring but critical work—standardizing dataset splits, metrics, and compute environments—so paper results become reproducible rather than aspirational.
The interesting bit The platform lets organizers plug in their own worker clusters for heavy jobs, and it can evaluate submissions inside Docker containers so your code runs in the same environment it was tested in. The “faster evaluation” trick is pleasantly mechanical: preload the dataset into memory and shard it across cores at startup. No magic, just warm caches.
Key highlights
- Supports arbitrary evaluation phases and dataset splits, in any language.
- Remote worker nodes let organizers bring their own compute for large-scale challenges.
- Docker-based evaluation runs submissions inside isolated environments.
- Companion CLI (
evalai-cli) for terminal-driven workflows. - Built on standard open-source stack: Django, Node.js, PostgreSQL, Docker.
- Published academic project with a SOSP 2019 paper and active maintenance.
Caveats
- The README claims setup is “really easy” via Docker Compose, but warns it “might take a while” and requires separate profiles for worker services.
- Default local credentials are literally
admin/password,host/password,participant/password—convenient for demos, alarming if forgotten.
Verdict Academic labs and challenge organizers who need transparency and control should look here; casual Kaggle competitors will find the self-hosting overhead tedious. If you just want to run a quick leaderboard for a class project, this might be more forklift than wheelbarrow.