4th PVUW MeViS 3rd Place Report: Sa2VA Apr 1, 2025 Language Modeling Language Modelling
Code Code Available 5Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Jan 7, 2025 2k Language Modeling
Code Code Available 5The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation Apr 7, 2025 Inference Optimization Referring Video Object Segmentation
Code Code Available 5LISA: Reasoning Segmentation via Large Language Model Aug 1, 2023 Language Modeling Language Modelling
Code Code Available 4Tracking Anything with Decoupled Video Segmentation Sep 7, 2023 Open-Vocabulary Video Segmentation Open-World Video Segmentation
Code Code Available 3Universal Instance Perception as Object Discovery and Retrieval Mar 12, 2023 Described Object Detection Generalized Referring Expression Comprehension
Code Code Available 3UniVS: Unified and Universal Video Segmentation with Prompts as Queries Feb 28, 2024 Decoder Referring Expression Segmentation
Code Code Available 3SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation Nov 26, 2024 Natural Language Understanding Referring Video Object Segmentation
Code Code Available 3VISA: Reasoning Video Object Segmentation via Large Language Models Jul 16, 2024 Decoder Object
Code Code Available 3General Object Foundation Model for Images and Videos at Scale Dec 14, 2023 Instance Segmentation Long-tail Video Object Segmentation
Code Code Available 3Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation Apr 4, 2024 Contrastive Learning Referring Expression
Code Code Available 2Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation Mar 5, 2025 Object Referring Video Object Segmentation
Code Code Available 2GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation Apr 10, 2025 Contrastive Learning Language Modeling
Code Code Available 2HyperSeg: Towards Universal Visual Segmentation with Large Language Model Nov 26, 2024 Language Modeling Large Language Model
Code Code Available 2Language as Queries for Referring Video Object Segmentation Jan 3, 2022 Object Object Tracking
Code Code Available 2MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions Aug 16, 2023 Motion Expressions Guided Video Segmentation Object
Code Code Available 2One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Sep 29, 2024 All Image Segmentation
Code Code Available 2The Devil is in Temporal Token: High Quality Video Reasoning Segmentation Jan 15, 2025 Reasoning Segmentation Referring Expression Segmentation
Code Code Available 2UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces Dec 25, 2023 Image Segmentation Object
Code Code Available 2VideoMolmo: Spatio-Temporal Grounding Meets Pointing Jun 5, 2025 Autonomous Driving Autonomous Navigation
Code Code Available 2VLT: Vision-Language Transformer and Query Generation for Referring Segmentation Oct 28, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 2Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus Jul 4, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark Aug 1, 2020 Object One-shot visual object segmentation
Code Code Available 1Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation Mar 18, 2024 Referring Video Object Segmentation Semantic Segmentation
Code Code Available 1Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation May 25, 2023 Object Referring Expression Segmentation
Code Code Available 1Referring Video Object Segmentation via Language-aligned Track Selection Dec 2, 2024 Object Object Tracking
Code Code Available 11st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation Jan 1, 2024 Object Referring Video Object Segmentation
Code Code Available 1RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation Jul 3, 2023 Image Segmentation Referring Expression
Code Code Available 1Local-Global Context Aware Transformer for Language-Guided Video Segmentation Mar 18, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1Tracking with Human-Intent Reasoning Dec 29, 2023 Language Modelling Object
Code Code Available 1LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation Jun 14, 2023 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1ActionVOS: Actions as Prompts for Video Object Segmentation Jul 10, 2024 Object Referring Video Object Segmentation
Code Code Available 11st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation Jun 11, 2024 Referring Video Object Segmentation Segmentation
Code Code Available 1Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation Jun 8, 2022 Denoising Referring Video Object Segmentation
Code Code Available 1SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation May 26, 2023 cross-modal alignment Object
Code Code Available 1Spectrum-guided Multi-granularity Referring Video Object Segmentation Jul 25, 2023 Object Referring Expression Segmentation
Code Code Available 1MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation Jan 23, 2025 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1Multi-Attention Network for Compressed Video Referring Object Segmentation Jul 26, 2022 Object Referring Expression Segmentation
Code Code Available 1Temporally Consistent Referring Video Object Segmentation with Hybrid Memory Mar 28, 2024 HTR Object
Code Code Available 1End-to-End Referring Video Object Segmentation with Multimodal Transformers Nov 29, 2021 Inductive Bias Instance Segmentation
Code Code Available 11st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation Dec 27, 2022 Object Referring Video Object Segmentation
Code Code Available 1OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation Jul 18, 2023 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation Jun 18, 2024 Contrastive Learning Object
— Unverified 0UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track Aug 19, 2024 Referring Video Object Segmentation Semantic Segmentation
— Unverified 0InterRVOS: Interaction-aware Referring Video Object Segmentation Jun 3, 2025 8k Object
— Unverified 0Learning Referring Video Object Segmentation from Weak Annotation Aug 4, 2023 Contrastive Learning Object
— Unverified 0Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation May 19, 2025 Referring Video Object Segmentation Semantic Segmentation
— Unverified 0LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation Sep 9, 2024 Object Referring Video Object Segmentation
— Unverified 02nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation Jun 20, 2024 Instance Segmentation Referring Video Object Segmentation
— Unverified 0Multi-Level Representation Learning With Semantic Alignment for Referring Video Object Segmentation Jan 1, 2022 Object Referring Expression Segmentation
— Unverified 0