SOTAVerified

Referring Expression Comprehension

Papers

Showing 4150 of 167 papers

TitleStatusHype
Mini-Gemini: Mining the Potential of Multi-modality Vision Language ModelsCode7
PropTest: Automatic Property Testing for Improved Visual Programming0
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLMCode1
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar0
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training0
Efficient Multimodal Learning from Data-centric PerspectiveCode5
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionCode1
An Open and Comprehensive Pipeline for Unified Object Grounding and DetectionCode1
Revisiting Counterfactual Problems in Referring Expression ComprehensionCode0
Show:102550
← PrevPage 5 of 17Next →

No leaderboard results yet.