hkchengrex/XMem
A PyTorch implementation of XMem, a deep learning video object segmentation system using an Atkinson-Shiffrin memory model to track objects across very long videos.

XMem addresses video object segmentation by framing it as a memory problem, using an Atkinson-Shiffrin-inspired architecture to maintain object identity across frames over extended video sequences. The system handles occlusion and long-term dependencies through multi-level memory stores. Implemented in PyTorch, it includes a GUI for interactive video segmentation and achieves approximately 20 FPS on GPU.