| 3D-Aware Instance Segmentation and Tracking in Egocentric Videos | Aug 19, 2024 | 3D Object ReconstructionInstance Segmentation | —Unverified | 0 |
| UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track | Aug 19, 2024 | Referring Video Object SegmentationSemantic Segmentation | —Unverified | 0 |
| Fast Sprite Decomposition from Animated Graphics | Aug 7, 2024 | Semantic SegmentationVideo Object Segmentation | —Unverified | 0 |
| Biomedical SAM 2: Segment Anything in Biomedical Images and Videos | Aug 6, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation | Jul 31, 2024 | ObjectSegmentation | CodeCode Available | 0 |
| Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video | Jul 22, 2024 | DisentanglementKnowledge Distillation | CodeCode Available | 0 |
| Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| Learning Spatial-Semantic Features for Robust Video Object Segmentation | Jul 10, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Rethinking Image-to-Video Adaptation: An Object-centric Perspective | Jul 9, 2024 | Action RecognitionObject | —Unverified | 0 |
| Submodular video object proposal selection for semantic object segmentation | Jul 8, 2024 | ObjectSegmentation | —Unverified | 0 |
| Non-parametric Contextual Relationship Learning for Semantic Video Object Segmentation | Jul 8, 2024 | Semantic SegmentationVideo Object Segmentation | —Unverified | 0 |
| Context Propagation from Proposals for Semantic Video Object Segmentation | Jul 8, 2024 | ObjectSegmentation | —Unverified | 0 |
| 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 20, 2024 | Instance SegmentationReferring Video Object Segmentation | —Unverified | 0 |
| Trusted Video Inpainting Localization via Deep Attentive Noise Learning | Jun 19, 2024 | Semantic SegmentationVideo Inpainting | CodeCode Available | 0 |
| ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection | Jun 18, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation | Jun 18, 2024 | Contrastive LearningObject | —Unverified | 0 |
| RMem: Restricted Memory Banks Improve Video Object Segmentation | Jun 12, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 12, 2024 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| Training-Free Robust Interactive Video Object Segmentation | Jun 8, 2024 | Interactive Video Object SegmentationObject | —Unverified | 0 |
| 3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation | Jun 7, 2024 | Referring Video Object SegmentationSemantic Segmentation | —Unverified | 0 |
| A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation | Jun 7, 2024 | Multi-Task LearningObject | —Unverified | 0 |
| 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation | Jun 7, 2024 | ObjectSegmentation | —Unverified | 0 |
| 3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 6, 2024 | ObjectPosition | —Unverified | 0 |
| Lifelong Learning Using a Dynamically Growing Tree of Sub-networks for Domain Generalization in Video Object Segmentation | May 29, 2024 | Domain GeneralizationLifelong learning | —Unverified | 0 |
| One-shot Training for Video Object Segmentation | May 22, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation | May 17, 2024 | Referring Expression SegmentationReferring Video Object Segmentation | —Unverified | 0 |
| Global Motion Understanding in Large-Scale Video Object Segmentation | May 11, 2024 | Instance SegmentationOptical Flow Estimation | —Unverified | 0 |
| DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation | May 11, 2024 | Optical Flow EstimationSemantic Segmentation | —Unverified | 0 |
| Space-time Reinforcement Network for Video Object Segmentation | May 7, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| 360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos | Apr 22, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Spatial-Temporal Multi-level Association for Video Object Segmentation | Apr 9, 2024 | ObjectSegmentation | —Unverified | 0 |
| Annolid: Annotate, Segment, and Track Anything You Need | Mar 27, 2024 | Instance SegmentationSegmentation | CodeCode Available | 0 |
| OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework | Mar 13, 2024 | AllManagement | —Unverified | 0 |
| ClickVOS: Click Video Object Segmentation | Mar 10, 2024 | ObjectSegmentation | CodeCode Available | 0 |
| Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation | Feb 14, 2024 | DecoderObject | —Unverified | 0 |
| Point-VOS: Pointing Up Video Object Segmentation | Feb 8, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound | Feb 7, 2024 | AllLesion Segmentation | —Unverified | 0 |
| Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention | Jan 25, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation | Jan 23, 2024 | Interactive Video Object SegmentationSemantic Segmentation | —Unverified | 0 |
| Understanding Video Transformers via Universal Concept Discovery | Jan 19, 2024 | Action RecognitionDecision Making | —Unverified | 0 |
| Learning to Segment Referred Objects from Narrated Egocentric Videos | Jan 1, 2024 | ObjectSegmentation | —Unverified | 0 |
| No More Shortcuts: Realizing the Potential of Temporal Self-Supervision | Dec 20, 2023 | Action ClassificationAttribute | —Unverified | 0 |
| Hierarchical Graph Pattern Understanding for Zero-Shot VOS | Dec 15, 2023 | DecoderGraph Neural Network | CodeCode Available | 0 |
| TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking | Dec 13, 2023 | Semantic SegmentationVideo Object Segmentation | —Unverified | 0 |
| Semi-supervised Active Learning for Video Action Detection | Dec 12, 2023 | Action DetectionActive Learning | CodeCode Available | 0 |
| Flexible visual prompts for in-context learning in computer vision | Dec 11, 2023 | Image SegmentationIn-Context Learning | CodeCode Available | 0 |
| VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Nov 30, 2023 | Semantic SegmentationVideo Editing | —Unverified | 0 |
| SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation | Nov 30, 2023 | Objectobject-detection | —Unverified | 0 |
| Sketch-based Video Object Segmentation: Benchmark and Analysis | Nov 13, 2023 | ObjectSegmentation | —Unverified | 0 |
| Learning the What and How of Annotation in Video Object Segmentation | Nov 8, 2023 | SegmentationSemantic Segmentation | —Unverified | 0 |