SOTAVerified|Agents Browse Leaderboard About Blog

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 364 papers

Title	Date	Tasks	Status	Hype	Score
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models	May 29, 2025	Referring ExpressionReferring Expression Comprehension	CodeCode Available	2	5
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models	Jun 24, 2024	Referring ExpressionReferring Expression Comprehension	CodeCode Available	2	5
Elysium: Exploring Object-level Perception in Videos via MLLM	Mar 25, 2024	ObjectObject Tracking	CodeCode Available	2	5
NExT-Chat: An LMM for Chat, Detection and Segmentation	Nov 8, 2023	Referring ExpressionReferring Expression Segmentation	CodeCode Available	2	5
MDETR - Modulated Detection for End-to-End Multi-Modal Understanding	Jan 1, 2021	Phrase GroundingQuestion Answering	CodeCode Available	2	5
GRES: Generalized Referring Expression Segmentation	Jun 1, 2023	Generalized Referring Expression SegmentationReferring Expression	CodeCode Available	2	5
GREC: Generalized Referring Expression Comprehension	Aug 30, 2023	Generalized Referring Expression ComprehensionReferring Expression	CodeCode Available	2	5
F-LMM: Grounding Frozen Large Multimodal Models	Jun 9, 2024	General KnowledgeInstruction Following	CodeCode Available	2	5
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation	Apr 4, 2024	Contrastive LearningReferring Expression	CodeCode Available	2	5
GLaMM: Pixel Grounding Large Multimodal Model	Nov 6, 2023	Conversational Question AnsweringImage Captioning	CodeCode Available	2	5

Show:10 25 50

← PrevPage 2 of 37Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Random	Acc@0.5m	14.6	—	Unverified