← all repositories

bradyz/cross_view_transformers

A transformer-based model that fuses multi-view camera images to produce semantic map-view segmentation at 45 FPS for autonomous driving.

575 stars Python Computer VisionDomain Apps
cross_view_transformers
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

This repository implements Cross-view Transformers, a CVPR 2022 paper that processes multiple camera perspectives (e.g., front, back, sides) and predicts semantic segmentation in a top-down map coordinate space. The model uses cross-view attention mechanisms to learn spatial relationships between image pixels and map locations. It supports nuScenes and KITTI datasets and enables real-time perception at 45 FPS with vehicle pose fusion for map construction over time.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.