The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation Apr 7, 2025 Inference Optimization Referring Video Object Segmentation
Code Code Available 54th PVUW MeViS 3rd Place Report: Sa2VA Apr 1, 2025 Language Modeling Language Modelling
Code Code Available 5Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Jan 7, 2025 2k Language Modeling
Code Code Available 5LISA: Reasoning Segmentation via Large Language Model Aug 1, 2023 Language Modeling Language Modelling
Code Code Available 4SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation Nov 26, 2024 Natural Language Understanding Referring Video Object Segmentation
Code Code Available 3VISA: Reasoning Video Object Segmentation via Large Language Models Jul 16, 2024 Decoder Object
Code Code Available 3UniVS: Unified and Universal Video Segmentation with Prompts as Queries Feb 28, 2024 Decoder Referring Expression Segmentation
Code Code Available 3General Object Foundation Model for Images and Videos at Scale Dec 14, 2023 Instance Segmentation Long-tail Video Object Segmentation
Code Code Available 3Tracking Anything with Decoupled Video Segmentation Sep 7, 2023 Open-Vocabulary Video Segmentation Open-World Video Segmentation
Code Code Available 3Universal Instance Perception as Object Discovery and Retrieval Mar 12, 2023 Described Object Detection Generalized Referring Expression Comprehension
Code Code Available 3VideoMolmo: Spatio-Temporal Grounding Meets Pointing Jun 5, 2025 Autonomous Driving Autonomous Navigation
Code Code Available 2GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation Apr 10, 2025 Contrastive Learning Language Modeling
Code Code Available 2Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation Mar 5, 2025 Object Referring Video Object Segmentation
Code Code Available 2The Devil is in Temporal Token: High Quality Video Reasoning Segmentation Jan 15, 2025 Reasoning Segmentation Referring Expression Segmentation
Code Code Available 2HyperSeg: Towards Universal Visual Segmentation with Large Language Model Nov 26, 2024 Language Modeling Large Language Model
Code Code Available 2One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Sep 29, 2024 All Image Segmentation
Code Code Available 2Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation Apr 4, 2024 Contrastive Learning Referring Expression
Code Code Available 2UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces Dec 25, 2023 Image Segmentation Object
Code Code Available 2MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions Aug 16, 2023 Motion Expressions Guided Video Segmentation Object
Code Code Available 2VLT: Vision-Language Transformer and Query Generation for Referring Segmentation Oct 28, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 2Language as Queries for Referring Video Object Segmentation Jan 3, 2022 Object Object Tracking
Code Code Available 2MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation Jan 23, 2025 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1Referring Video Object Segmentation via Language-aligned Track Selection Dec 2, 2024 Object Object Tracking
Code Code Available 1ActionVOS: Actions as Prompts for Video Object Segmentation Jul 10, 2024 Object Referring Video Object Segmentation
Code Code Available 11st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation Jun 11, 2024 Referring Video Object Segmentation Segmentation
Code Code Available 1Temporally Consistent Referring Video Object Segmentation with Hybrid Memory Mar 28, 2024 HTR Object
Code Code Available 1Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation Mar 18, 2024 Referring Video Object Segmentation Semantic Segmentation
Code Code Available 11st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation Jan 1, 2024 Object Referring Video Object Segmentation
Code Code Available 1Tracking with Human-Intent Reasoning Dec 29, 2023 Language Modelling Object
Code Code Available 1Spectrum-guided Multi-granularity Referring Video Object Segmentation Jul 25, 2023 Object Referring Expression Segmentation
Code Code Available 1OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation Jul 18, 2023 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation Jul 3, 2023 Image Segmentation Referring Expression
Code Code Available 1LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation Jun 14, 2023 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation May 26, 2023 cross-modal alignment Object
Code Code Available 1Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation May 25, 2023 Object Referring Expression Segmentation
Code Code Available 11st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation Dec 27, 2022 Object Referring Video Object Segmentation
Code Code Available 1Multi-Attention Network for Compressed Video Referring Object Segmentation Jul 26, 2022 Object Referring Expression Segmentation
Code Code Available 1Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus Jul 4, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation Jun 8, 2022 Denoising Referring Video Object Segmentation
Code Code Available 1Local-Global Context Aware Transformer for Language-Guided Video Segmentation Mar 18, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1End-to-End Referring Video Object Segmentation with Multimodal Transformers Nov 29, 2021 Inductive Bias Instance Segmentation
Code Code Available 1URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark Aug 1, 2020 Object One-shot visual object segmentation
Code Code Available 1InterRVOS: Interaction-aware Referring Video Object Segmentation Jun 3, 2025 8k Object
— Unverified 0Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation May 19, 2025 Referring Video Object Segmentation Semantic Segmentation
— Unverified 0Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence Matching Apr 18, 2025 Object Referring Video Object Segmentation
Code Code Available 0ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025 Mar 30, 2025 Object Referring Video Object Segmentation
Code Code Available 0ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations Jan 24, 2025 Decoder Object
— Unverified 0InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Jan 21, 2025 Object Tracking Referring Expression Segmentation
Code Code Available 0Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation Jan 9, 2025 Referring Video Object Segmentation Semantic Segmentation
Code Code Available 0DTOS: Dynamic Time Object Sensing with Large Multimodal Model Jan 1, 2025 Moment Retrieval Referring Video Object Segmentation
Code Code Available 0