SOTAVerified

Referring Video Object Segmentation

Referring video object segmentation aims at segmenting an object in video with language expressions. Unlike the previous video object segmentation, the task exploits a different type of supervision, language expressions, to identify and segment an object referred by the given language expressions in a video.

Papers

Showing 4150 of 74 papers

TitleStatusHype
UniRef++: Segment Every Reference Object in Spatial and Temporal SpacesCode2
General Object Foundation Model for Images and Videos at ScaleCode3
Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation0
Temporal Collection and Distribution for Referring Video Object Segmentation0
Tracking Anything with Decoupled Video SegmentationCode3
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited SamplesCode0
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
Expression Prompt Collaboration Transformer for Universal Referring Video Object SegmentationCode0
Learning Referring Video Object Segmentation from Weak Annotation0
LISA: Reasoning Segmentation via Large Language ModelCode4
Show:102550
← PrevPage 5 of 8Next →

No leaderboard results yet.