| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation | Jan 23, 2025 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation | Jan 14, 2025 | Objectobject-detection | CodeCode Available | 1 |
| M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation | Dec 18, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Referring Video Object Segmentation via Language-aligned Track Selection | Dec 2, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| LiVOS: Light Video Object Segmentation with Gated Linear Matching | Nov 5, 2024 | GPUSemantic Segmentation | CodeCode Available | 1 |
| X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation | Sep 28, 2024 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 1 |
| ActionVOS: Actions as Prompts for Video Object Segmentation | Jul 10, 2024 | ObjectReferring Video Object Segmentation | CodeCode Available | 1 |
| Video Inpainting Localization with Contrastive Learning | Jun 25, 2024 | Contrastive LearningDecoder | CodeCode Available | 1 |
| 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 11, 2024 | Referring Video Object SegmentationSegmentation | CodeCode Available | 1 |
| Event-assisted Low-Light Video Object Segmentation | Apr 2, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Temporally Consistent Referring Video Object Segmentation with Hybrid Memory | Mar 28, 2024 | HTRObject | CodeCode Available | 1 |
| Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation | Mar 18, 2024 | Referring Video Object SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Video Object Segmentation with Dynamic Query Modulation | Mar 18, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything | Mar 12, 2024 | GPUPoint Tracking | CodeCode Available | 1 |
| Depth-aware Test-Time Training for Zero-shot Video Object Segmentation | Mar 7, 2024 | Depth EstimationDepth Prediction | CodeCode Available | 1 |
| VideoMAC: Video Masked Autoencoders Meet ConvNets | Feb 29, 2024 | Pose TrackingRepresentation Learning | CodeCode Available | 1 |
| Lester: rotoscope animation through video object segmentation and tracking | Feb 15, 2024 | 3D Human Pose EstimationObject | CodeCode Available | 1 |
| 1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation | Jan 1, 2024 | ObjectReferring Video Object Segmentation | CodeCode Available | 1 |
| Tracking with Human-Intent Reasoning | Dec 29, 2023 | Language ModellingObject | CodeCode Available | 1 |
| Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Nov 29, 2023 | ClusteringObject | CodeCode Available | 1 |
| SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation | Nov 24, 2023 | Meta-LearningOne-Shot Segmentation | CodeCode Available | 1 |
| Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation | Sep 26, 2023 | ObjectOptical Flow Estimation | CodeCode Available | 1 |
| PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation | Sep 21, 2023 | Autonomous DrivingSegmentation | CodeCode Available | 1 |
| Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation | Aug 25, 2023 | ObjectObject Tracking | CodeCode Available | 1 |
| Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation | Aug 13, 2023 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 1 |
| Spectrum-guided Multi-granularity Referring Video Object Segmentation | Jul 25, 2023 | ObjectReferring Expression Segmentation | CodeCode Available | 1 |
| OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation | Jul 18, 2023 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation | Jul 3, 2023 | Image SegmentationReferring Expression | CodeCode Available | 1 |
| LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation | Jun 14, 2023 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation | May 26, 2023 | cross-modal alignmentObject | CodeCode Available | 1 |
| Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation | May 25, 2023 | ObjectReferring Expression Segmentation | CodeCode Available | 1 |
| UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model | May 22, 2023 | Image SegmentationObject | CodeCode Available | 1 |
| Event-Free Moving Object Segmentation from Moving Ego Vehicle | Apr 28, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping | Apr 17, 2023 | Motion SegmentationObject | CodeCode Available | 1 |
| Segment Everything Everywhere All at Once | Apr 13, 2023 | AllDecoder | CodeCode Available | 1 |
| Boosting Video Object Segmentation via Space-time Correspondence Learning | Apr 13, 2023 | ObjectSegmentation | CodeCode Available | 1 |
| DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks | Apr 2, 2023 | DiversityObject Tracking | CodeCode Available | 1 |
| Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation | Mar 25, 2023 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 1 |
| CrOC: Cross-View Online Clustering for Dense Visual Representation Learning | Mar 23, 2023 | ClusteringOnline Clustering | CodeCode Available | 1 |
| Two-shot Video Object Segmentation | Mar 21, 2023 | ObjectPseudo Label | CodeCode Available | 1 |
| Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation | Mar 18, 2023 | ObjectOptical Flow Estimation | CodeCode Available | 1 |
| Guided Slot Attention for Unsupervised Video Object Segmentation | Mar 15, 2023 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot Interaction | Feb 7, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| TarViS: A Unified Approach for Target-based Video Segmentation | Jan 6, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| End-to-End Video Matting With Trimap Propagation | Jan 1, 2023 | Image MattingSegmentation | CodeCode Available | 1 |
| Video Object Segmentation-aware Video Frame Interpolation | Jan 1, 2023 | ObjectPose Estimation | CodeCode Available | 1 |
| 1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation | Dec 27, 2022 | ObjectReferring Video Object Segmentation | CodeCode Available | 1 |
| Learning to Learn Better for Video Object Segmentation | Dec 5, 2022 | Inductive LearningObject | CodeCode Available | 1 |