SOTAVerified

Referring Expression Comprehension

Papers

Showing 6170 of 167 papers

TitleStatusHype
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and GroundingCode1
NS3D: Neuro-Symbolic Grounding of 3D Objects and RelationsCode1
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionCode1
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
MDETR -- Modulated Detection for End-to-End Multi-Modal UnderstandingCode1
An Open and Comprehensive Pipeline for Unified Object Grounding and DetectionCode1
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression ComprehensionCode1
Learning to Evaluate Performance of Multi-modal Semantic LocalizationCode1
SeqTR: A Simple yet Universal Network for Visual GroundingCode1
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and CaptionsCode1
Show:102550
← PrevPage 7 of 17Next →

No leaderboard results yet.