SOTAVerified

Referring Expression Comprehension

Papers

Showing 2130 of 167 papers

TitleStatusHype
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal FusionCode2
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionCode1
Described Object Detection: Liberating Object Detection with Flexible ExpressionsCode1
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression ComprehensionCode1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLMCode1
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEsCode1
Learning to Evaluate Performance of Multi-modal Semantic LocalizationCode1
Show:102550
← PrevPage 3 of 17Next →

No leaderboard results yet.