← all repositories

fudan-zvg/SETR

A semantic segmentation model that rethinks segmentation as a sequence-to-sequence problem using transformer encoder architecture.

1.1k stars Python Computer Vision
SETR
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

SETR applies Vision Transformers to semantic segmentation tasks by treating image patches as sequences and using a transformer encoder for dense prediction. The project provides model implementations and configurations for Cityscapes and other segmentation benchmarks, including SETR-Naive and SETR-MLA variants with pretrained weights.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.