SOTAVerified

Referring Expression Comprehension

Papers

Showing 4150 of 167 papers

TitleStatusHype
Described Object Detection: Liberating Object Detection with Flexible ExpressionsCode1
Kosmos-2: Grounding Multimodal Large Language Models to the WorldCode1
NS3D: Neuro-Symbolic Grounding of 3D Objects and RelationsCode1
PolyFormer: Referring Image Segmentation as Sequential Polygon GenerationCode1
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and GroundingCode1
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun DistillationCode1
VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature AlignmentCode1
Learning to Evaluate Performance of Multi-modal Semantic LocalizationCode1
Correspondence Matters for Video Referring Expression ComprehensionCode1
Improving Visual Grounding by Encouraging Consistent Gradient-based ExplanationsCode1
Show:102550
← PrevPage 5 of 17Next →

No leaderboard results yet.