SOTAVerified

Referring Video Object Segmentation

Referring video object segmentation aims at segmenting an object in video with language expressions. Unlike the previous video object segmentation, the task exploits a different type of supervision, language expressions, to identify and segment an object referred by the given language expressions in a video.

Papers

Showing 110 of 74 papers

TitleStatusHype
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
LISA: Reasoning Segmentation via Large Language ModelCode4
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video SegmentationCode3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
General Object Foundation Model for Images and Videos at ScaleCode3
Tracking Anything with Decoupled Video SegmentationCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
Show:102550
← PrevPage 1 of 8Next →

No leaderboard results yet.