taco-group/OpenEMMA
OpenEMMA is an open-source end-to-end autonomous driving framework that uses pretrained Vision Language Models to predict ego waypoints from camera and text inputs.

Velocity · 7d
+1.6
★ / day
Trend
→steady
star history
OpenEMMA is an open-source reproduction of Waymo’s EMMA model for autonomous driving. It leverages pretrained VLMs such as GPT-4 and LLaVA to integrate front-view camera images and text inputs, enabling real-time prediction of future ego waypoints along with decision rationales. The project provides a PyPI-installable package and an end-to-end pipeline for motion planning in autonomous vehicles.