octo-models/octo
A transformer-based generalist robotic policy trained on 800k robot trajectories for multimodal robot arm control.

Velocity · 7d
+1.8
★ / day
Trend
→steady
star history
Octo is a transformer-based diffusion policy that serves as a generalist robot policy. It processes multiple RGB camera inputs and can be controlled via language commands or goal images to generate actions for various robot arms. The model uses a modular attention structure in its transformer backbone, enabling effective finetuning to new robot setups with different sensory inputs, action spaces, and morphologies using limited target domain data.