← all repositories
MarvinTeichmann/MultiNet

One network, three driving tasks, 2016-era TensorFlow

A research prototype that squeezes road segmentation, car detection, and street classification into a single real-time encoder-decoder model.

556 stars Python Computer VisionDomain Apps
MultiNet
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

What it does

MultiNet runs three computer-vision tasks for autonomous driving through one shared VGG encoder, then splits into separate decoders for each job. The repo glues together three submodules—KittiSeg for road segmentation, KittiBox for car detection, and KittiClass for street classification—into a single TensorFlow network orchestrated by the TensorVision backend.

The interesting bit

The architecture bets that one encoder can learn features good enough for all three tasks, saving compute at inference time. The project also treats experiment hygiene as a first-class concern: every training run auto-generates a rundir with code snapshots, TensorBoard events, and checkpointing, which is rarer than it should be in research code.

Key highlights

  • Joint training of detection + segmentation works out of the box; classification training requires your own private data (the public dataset is missing)
  • Achieved state-of-the-art KITTI road segmentation results at the time, per the linked leaderboard
  • Heavy use of git submodules—forget git submodule update and you’ll run new MultiNet code against stale submodule code
  • Built for Python 2.7 and TensorFlow 1.0, with Cython compilation steps in the setup
  • Demo script runs standalone; full training needs KITTI data download and additional evaluation code builds

Caveats

  • The README explicitly warns that training the full three-task “MultiNet3” is not possible with public data alone
  • Dependency stack is frozen in 2016: Python 2.7 and TensorFlow 1.0 are both long past end-of-life

Verdict

Worth studying if you’re building multi-task perception architectures or care about reproducible experiment organization. Skip it if you need something production-ready today—the tooling debt is substantial and the upstream dependencies are abandoned.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.