| Collaborative Video Object Segmentation by Foreground-Background Integration | Mar 18, 2020 | ObjectOne-shot visual object segmentation | CodeCode Available | 1 | 5 |
| Learning Video Object Segmentation from Unlabeled Videos | Mar 10, 2020 | ObjectRepresentation Learning | CodeCode Available | 1 | 5 |
| Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration | Oct 13, 2020 | ObjectOne-shot visual object segmentation | CodeCode Available | 1 | 5 |
| Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation | Jul 27, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective | Mar 31, 2021 | Contrastive LearningObject | CodeCode Available | 1 | 5 |
| FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical Flow | Nov 20, 2021 | Optical Flow EstimationSegmentation | CodeCode Available | 1 | 5 |
| Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation | Jun 9, 2021 | Semantic SegmentationSemi-Supervised Video Object Segmentation | CodeCode Available | 1 | 5 |
| Lester: rotoscope animation through video object segmentation and tracking | Feb 15, 2024 | 3D Human Pose EstimationObject | CodeCode Available | 1 | 5 |
| Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation | Jun 8, 2022 | DenoisingReferring Video Object Segmentation | CodeCode Available | 1 | 5 |
| Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic Perspective | Nov 2, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Full-Duplex Strategy for Video Object Segmentation | Aug 6, 2021 | ObjectObject Detection | CodeCode Available | 1 | 5 |
| 1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation | Dec 27, 2022 | ObjectReferring Video Object Segmentation | CodeCode Available | 1 | 5 |
| CrOC: Cross-View Online Clustering for Dense Visual Representation Learning | Mar 23, 2023 | ClusteringOnline Clustering | CodeCode Available | 1 | 5 |
| M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation | Jun 15, 2025 | ObjectSemantic Segmentation | CodeCode Available | 1 | 5 |
| Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation | Mar 25, 2023 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 1 | 5 |
| Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation | Mar 18, 2024 | Referring Video Object SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Reliable Propagation-Correction Modulation for Video Object Segmentation | Dec 6, 2021 | ObjectSemantic Segmentation | CodeCode Available | 1 | 5 |
| SwiftNet: Real-time Video Object Segmentation | Feb 9, 2021 | ObjectSegmentation | CodeCode Available | 1 | 5 |
| Global Spectral Filter Memory Network for Video Object Segmentation | Oct 11, 2022 | AttributeDecoder | CodeCode Available | 1 | 5 |
| Learning Fast and Robust Target Models for Video Object Segmentation | Feb 27, 2020 | One-shot visual object segmentationSegmentation | CodeCode Available | 1 | 5 |
| D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos | Nov 15, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps | Apr 21, 2021 | Interactive SegmentationInteractive Video Object Segmentation | CodeCode Available | 1 | 5 |
| Guided Slot Attention for Unsupervised Video Object Segmentation | Mar 15, 2023 | ObjectSemantic Segmentation | CodeCode Available | 1 | 5 |
| D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos | Nov 15, 2021 | Multi-Object Tracking and SegmentationSegmentation | CodeCode Available | 1 | 5 |
| Event-assisted Low-Light Video Object Segmentation | Apr 2, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 | 5 |
| Referring Video Object Segmentation via Language-aligned Track Selection | Dec 2, 2024 | ObjectObject Tracking | CodeCode Available | 1 | 5 |
| EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Sep 26, 2022 | ObjectSegmentation | CodeCode Available | 1 | 5 |
| Hierarchical Memory Matching Network for Video Object Segmentation | Sep 23, 2021 | ObjectRetrieval | CodeCode Available | 1 | 5 |
| Joint Inductive and Transductive Learning for Video Object Segmentation | Aug 8, 2021 | Inductive LearningObject | CodeCode Available | 1 | 5 |
| Kernelized Memory Network for Video Object Segmentation | Jul 16, 2020 | ObjectSemantic Segmentation | CodeCode Available | 1 | 5 |
| Motion-Attentive Transition for Zero-Shot Video Object Segmentation | Mar 9, 2020 | DecoderObject | CodeCode Available | 1 | 5 |
| RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation | Jul 3, 2023 | Image SegmentationReferring Expression | CodeCode Available | 1 | 5 |
| End-to-End Video Matting With Trimap Propagation | Jan 1, 2023 | Image MattingSegmentation | CodeCode Available | 1 | 5 |
| End-to-End Semi-Supervised Learning for Video Action Detection | Mar 8, 2022 | Action DetectionClassification Consistency | CodeCode Available | 1 | 5 |
| BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video | Sep 25, 2022 | Long-tail Video Object SegmentationMulti-Object Tracking | CodeCode Available | 1 | 5 |
| Associating Objects with Transformers for Video Object Segmentation | Jun 4, 2021 | ObjectOne-shot visual object segmentation | CodeCode Available | 1 | 5 |
| End-to-End Referring Video Object Segmentation with Multimodal Transformers | Nov 29, 2021 | Inductive BiasInstance Segmentation | CodeCode Available | 1 | 5 |
| Multi-Attention Network for Compressed Video Referring Object Segmentation | Jul 26, 2022 | ObjectReferring Expression Segmentation | CodeCode Available | 1 | 5 |
| A Transductive Approach for Video Object Segmentation | Apr 15, 2020 | Instance SegmentationObject | CodeCode Available | 1 | 5 |
| Emerging Properties in Self-Supervised Vision Transformers | Apr 29, 2021 | Copy DetectionImage Classification | CodeCode Available | 1 | 5 |
| Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation | Aug 13, 2023 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 1 | 5 |
| RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation | Oct 1, 2020 | Image SegmentationReferring Expression Segmentation | CodeCode Available | 1 | 5 |
| See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks | Jan 19, 2020 | Semantic SegmentationUnsupervised Video Object Segmentation | CodeCode Available | 1 | 5 |
| SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation | Jan 21, 2021 | Inductive BiasMotion Segmentation | CodeCode Available | 1 | 5 |
| Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation | Aug 25, 2023 | ObjectObject Tracking | CodeCode Available | 1 | 5 |
| Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation | Oct 23, 2020 | ObjectOne-shot visual object segmentation | CodeCode Available | 1 | 5 |
| Interactive Video Object Segmentation Using Global and Local Transfer Modules | Jul 16, 2020 | DecoderInteractive Video Object Segmentation | CodeCode Available | 1 | 5 |
| Dense Unsupervised Learning for Video Segmentation | Nov 11, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Depth-aware Test-Time Training for Zero-shot Video Object Segmentation | Mar 7, 2024 | Depth EstimationDepth Prediction | CodeCode Available | 1 | 5 |
| UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking | Jan 15, 2020 | ObjectSegmentation | CodeCode Available | 1 | 5 |