| End-to-End Semi-Supervised Learning for Video Action Detection | Mar 8, 2022 | Action DetectionClassification Consistency | CodeCode Available | 1 |
| 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 11, 2024 | Referring Video Object SegmentationSegmentation | CodeCode Available | 1 |
| Accelerating Video Object Segmentation with Compressed Video | Jul 26, 2021 | ObjectSegmentation | CodeCode Available | 1 |
| LVOS: A Benchmark for Long-term Video Object Segmentation | Nov 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Making a Case for 3D Convolutions for Object Segmentation in Videos | Aug 26, 2020 | DecoderSegmentation | CodeCode Available | 1 |
| Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation | Mar 18, 2023 | ObjectOptical Flow Estimation | CodeCode Available | 1 |
| Boosting Video Object Segmentation via Space-time Correspondence Learning | Apr 13, 2023 | ObjectSegmentation | CodeCode Available | 1 |
| Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping | Apr 17, 2023 | Motion SegmentationObject | CodeCode Available | 1 |
| End-to-End Referring Video Object Segmentation with Multimodal Transformers | Nov 29, 2021 | Inductive BiasInstance Segmentation | CodeCode Available | 1 |
| Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning | May 12, 2021 | Landmark TrackingPose Tracking | CodeCode Available | 1 |
| A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information | Jun 6, 2022 | Action RecognitionSemantic Segmentation | CodeCode Available | 1 |
| Emerging Properties in Self-Supervised Vision Transformers | Apr 29, 2021 | Copy DetectionImage Classification | CodeCode Available | 1 |
| Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Nov 29, 2023 | ClusteringObject | CodeCode Available | 1 |
| Local-Global Context Aware Transformer for Language-Guided Video Segmentation | Mar 18, 2022 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| Lester: rotoscope animation through video object segmentation and tracking | Feb 15, 2024 | 3D Human Pose EstimationObject | CodeCode Available | 1 |
| LiVOS: Light Video Object Segmentation with Gated Linear Matching | Nov 5, 2024 | GPUSemantic Segmentation | CodeCode Available | 1 |
| LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation | Jun 14, 2023 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation | Dec 21, 2020 | One-shot visual object segmentationSegmentation | CodeCode Available | 1 |
| Autoencoder-based background reconstruction and foreground segmentation with background noise estimation | Dec 15, 2021 | Foreground SegmentationSegmentation | CodeCode Available | 1 |
| Learning Video Object Segmentation from Unlabeled Videos | Mar 10, 2020 | ObjectRepresentation Learning | CodeCode Available | 1 |
| Active Boundary Loss for Semantic Segmentation | Feb 4, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Attention-guided Temporally Coherent Video Object Matting | May 24, 2021 | Image MattingObject | CodeCode Available | 1 |
| End-to-End Video Matting With Trimap Propagation | Jan 1, 2023 | Image MattingSegmentation | CodeCode Available | 1 |
| Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild | Mar 18, 2021 | Deep Reinforcement LearningInteractive Video Object Segmentation | CodeCode Available | 1 |
| Learning What to Learn for Video Object Segmentation | Mar 25, 2020 | Few-Shot LearningObject | CodeCode Available | 1 |
| MAST: A Memory-Augmented Self-supervised Tracker | Feb 18, 2020 | Semantic SegmentationSemi-Supervised Video Object Segmentation | CodeCode Available | 1 |
| A Transductive Approach for Video Object Segmentation | Apr 15, 2020 | Instance SegmentationObject | CodeCode Available | 1 |
| ActionVOS: Actions as Prompts for Video Object Segmentation | Jul 10, 2024 | ObjectReferring Video Object Segmentation | CodeCode Available | 1 |
| Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation | Jan 1, 2021 | Semantic SegmentationUnsupervised Video Object Segmentation | CodeCode Available | 1 |
| Learning Fast and Robust Target Models for Video Object Segmentation | Feb 27, 2020 | One-shot visual object segmentationSegmentation | CodeCode Available | 1 |
| Joint Inductive and Transductive Learning for Video Object Segmentation | Aug 8, 2021 | Inductive LearningObject | CodeCode Available | 1 |
| Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation | Jan 14, 2025 | Objectobject-detection | CodeCode Available | 1 |
| Learning Object Depth from Camera Motion and Video Object Segmentation | Jul 11, 2020 | ObjectSegmentation | CodeCode Available | 1 |
| D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos | Nov 15, 2021 | Multi-Object Tracking and SegmentationSegmentation | CodeCode Available | 1 |
| D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos | Nov 15, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| A Simple and Powerful Global Optimization for Unsupervised Video Object Segmentation | Sep 19, 2022 | Clusteringglobal-optimization | CodeCode Available | 1 |
| DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks | Apr 2, 2023 | DiversityObject Tracking | CodeCode Available | 1 |
| Kernelized Memory Network for Video Object Segmentation | Jul 16, 2020 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Accelerating Volumetric Medical Image Annotation via Short-Long Memory SAM 2 | May 3, 2025 | Computed Tomography (CT)Semantic Segmentation | CodeCode Available | 1 |
| Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation | Jun 8, 2022 | DenoisingReferring Video Object Segmentation | CodeCode Available | 1 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| Associating Objects with Transformers for Video Object Segmentation | Jun 4, 2021 | ObjectOne-shot visual object segmentation | CodeCode Available | 1 |
| Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation | Aug 25, 2023 | ObjectObject Tracking | CodeCode Available | 1 |
| Interactive Video Object Segmentation Using Global and Local Transfer Modules | Jul 16, 2020 | DecoderInteractive Video Object Segmentation | CodeCode Available | 1 |
| CrOC: Cross-View Online Clustering for Dense Visual Representation Learning | Mar 23, 2023 | ClusteringOnline Clustering | CodeCode Available | 1 |
| Delving Deep Into Many-to-Many Attention for Few-Shot Video Object Segmentation | Jun 19, 2021 | Meta-LearningSemantic Segmentation | CodeCode Available | 1 |
| Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation | Oct 23, 2020 | ObjectOne-shot visual object segmentation | CodeCode Available | 1 |
| Dense Unsupervised Learning for Video Segmentation | Nov 11, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Depth-aware Test-Time Training for Zero-shot Video Object Segmentation | Mar 7, 2024 | Depth EstimationDepth Prediction | CodeCode Available | 1 |
| 1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation | Dec 27, 2022 | ObjectReferring Video Object Segmentation | CodeCode Available | 1 |