mees/calvin
A simulated benchmark for training and evaluating deep-learning policies that control robots to perform language-specified manipulation tasks.

Velocity · 7d
+0.5
★ / day
Trend
→steady
star history
CALVIN provides a PyTorch-based simulated environment for long-horizon robot manipulation tasks conditioned on natural language instructions. The benchmark evaluates agents on composing actions from language and vision, supporting flexible sensor suites. It won the 2022 IEEE RA-L Best Paper Award and serves as a testbed for vision-and-language policy learning research.