← all repositories
facebookresearch/CommAI-env

Teaching AI like a toddler — one task at a time

A research environment that tests whether language models can learn incrementally from a stream of simple tasks, the way humans do.

1.3k stars AgentsML Frameworks
CommAI-env
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

What it does

CommAI-env is a task environment for training and evaluating AI systems on a curriculum of small, communicative challenges. Tasks are fed to the learner one by one through a text interface; the system must figure out what is being asked and respond correctly. It is built to match the incremental, task-by-task learning framework proposed in Facebook AI Research’s 2015 “Roadmap towards Machine Intelligence” paper.

The interesting bit

The environment is deliberately minimal — just a text channel between teacher and learner — because the authors believe general intelligence emerges from solving many simple problems in sequence, not from one giant pretrained model. The roadmap itself is the more substantial contribution; this repo is the reference implementation.

Key highlights

  • Tasks are composable and can be mixed into curricula of varying difficulty
  • Learners interact through a single ASCII stream, so any text-in/text-out model can plug in
  • Includes a set of baseline tasks covering counting, list manipulation, and simple Q&A
  • Designed for incremental learning: new tasks are introduced without retraining from scratch
  • The companion paper (arXiv:1511.08130) lays out the full theoretical framework

Caveats

  • The repository has not seen commits in several years; the code targets Python 2/early 3
  • 1326 stars but minimal recent activity suggests the community moved on to other benchmarks
  • The README is sparse on setup details and expects familiarity with the paper

Verdict

Worth reading for the roadmap paper’s arguments about incremental, communication-based learning. The code itself is a thin scaffold — useful if you are replicating the exact setup, but modern researchers will likely prefer more active environments like BabyAI or ProcGen.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.