← all repositories

JackHopkins/factorio-learning-environment

An open-source framework for evaluating LLM agents through gameplay in the Factorio video game.

1k stars Python LLMOps · EvalAgents
Velocity · 7d
+0.5
★ / day
Trend
steady
collecting data…
star history

The Factorio Learning Environment provides a benchmark and development toolkit for testing large language model agents in the complex open-ended game of Factorio. It offers a Docker-based sandbox environment with a Python SDK, MCP server integration, and PostgreSQL support for tracking evaluation trajectories. The framework includes built-in metrics and a public leaderboard for comparing LLM agent performance.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.