meta-pytorch/segment-anything-fast
A batched offline inference-oriented fork of Meta's Segment Anything model optimized with PyTorch torch.compile and custom Triton kernels.

Velocity · 7d
+1.3
★ / day
Trend
→steady
star history
This repository provides a drop-in replacement for Meta’s Segment Anything model with a focus on fast inference. It automatically applies eval mode, bfloat16 precision, and torch.compile with max-autotune. The key optimization is a custom Triton kernel implementing efficient self-distributed pattern attention (SDPA) for relative positional encodings, tuned for A100 GPUs with automatic autotuning for other devices.