SOTAVerified

Referring Video Object Segmentation

Referring video object segmentation aims at segmenting an object in video with language expressions. Unlike the previous video object segmentation, the task exploits a different type of supervision, language expressions, to identify and segment an object referred by the given language expressions in a video.

Papers

Showing 5160 of 74 papers

TitleStatusHype
Multi-Level Representation Learning With Semantic Alignment for Referring Video Object Segmentation0
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations0
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object Segmentation0
Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation0
Robust Referring Video Object Segmentation with Cyclic Structural Consensus0
Decoupled Motion Expression Video Segmentation0
Segment Every Reference Object in Spatial and Temporal Spaces0
Semantic and Sequential Alignment for Referring Video Object Segmentation0
Temporal Collection and Distribution for Referring Video Object Segmentation0
The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation0
Show:102550
← PrevPage 6 of 8Next →

No leaderboard results yet.