SOTAVerified

Referring Video Object Segmentation

Referring video object segmentation aims at segmenting an object in video with language expressions. Unlike the previous video object segmentation, the task exploits a different type of supervision, language expressions, to identify and segment an object referred by the given language expressions in a video.

Papers

Showing 110 of 74 papers

TitleStatusHype
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
LISA: Reasoning Segmentation via Large Language ModelCode4
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
Tracking Anything with Decoupled Video SegmentationCode3
General Object Foundation Model for Images and Videos at ScaleCode3
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video SegmentationCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
Show:102550
← PrevPage 1 of 8Next →

No leaderboard results yet.