google-research/robotics_transformer
RT-1 is a transformer model that uses image tokenization and learned token compression for end-to-end robotic control.

Velocity · 7d
+1.3
★ / day
Trend
→steady
star history
Robotics Transformer (RT-1) is a Google Research project that implements a transformer architecture for controlling robots. It uses a FiLM EfficientNet-based image tokenizer to process visual inputs, a token learner for compressing input tokens, and a transformer decoder for producing robotic actions. The repository includes pre-trained checkpoints for multiple robot control scenarios including multi-robot and sim-to-real deployment.