hkchengrex/MiVOS
A CVPR 2021 research project for interactive video object segmentation combining user interactions with mask propagation and difference-aware fusion.

MiVOS implements modular interactive video object segmentation, allowing users to provide sparse interactions that are converted to masks and then propagated across video frames. The system combines an interaction-to-mask module, a mask propagation module, and a difference-aware fusion module. It supports both semi-supervised VOS evaluation on DAVIS/YouTube and interactive evaluation with a user GUI. The project is implemented in Python using PyTorch.