SOTAVerified

Referring Expression Comprehension

Papers

Showing 1120 of 167 papers

TitleStatusHype
General Object Foundation Model for Images and Videos at ScaleCode3
Towards Visual Grounding: A SurveyCode3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
GREC: Generalized Referring Expression ComprehensionCode2
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text ModelsCode2
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal FusionCode2
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal ModelsCode2
Show:102550
← PrevPage 2 of 17Next →

No leaderboard results yet.