SOTAVerified

Referring Video Object Segmentation

Referring video object segmentation aims at segmenting an object in video with language expressions. Unlike the previous video object segmentation, the task exploits a different type of supervision, language expressions, to identify and segment an object referred by the given language expressions in a video.

Papers

Showing 1120 of 74 papers

TitleStatusHype
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object SegmentationCode1
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context ModelingCode0
The Devil is in Temporal Token: High Quality Video Reasoning SegmentationCode2
Multi-Context Temporal Consistent Modeling for Referring Video Object SegmentationCode0
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
DTOS: Dynamic Time Object Sensing with Large Multimodal ModelCode0
Semantic and Sequential Alignment for Referring Video Object Segmentation0
Decoupled Motion Expression Video Segmentation0
Referring Video Object Segmentation via Language-aligned Track SelectionCode1
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video SegmentationCode3
Show:102550
← PrevPage 2 of 8Next →

No leaderboard results yet.