SOTAVerified|Agents Browse Leaderboard About Blog

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 364 papers

Title	Date	Tasks	Status	Hype
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation	Jan 1, 2024	DescriptiveObject	CodeCode Available	2
Viewpoint-Aware Visual Grounding in 3D Scenes	Jan 1, 2024	3D visual groundingReferring Expression	—Unverified	0
Referring Expression Counting	Jan 1, 2024	8kobject-detection	CodeCode Available	1
Tune-An-Ellipse: CLIP Has Potential to Find What You Want	Jan 1, 2024	ObjectReferring Expression	CodeCode Available	1
Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction	Dec 21, 2023	16kAttribute	—Unverified	0
GSVA: Generalized Segmentation via Multimodal Large Language Models	Dec 15, 2023	DecoderGeneralized Referring Expression Segmentation	CodeCode Available	1
Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation	Dec 13, 2023	DescriptiveObject	CodeCode Available	1
Localized Symbolic Knowledge Distillation for Visual Commonsense Models	Dec 8, 2023	Image DescriptionInstruction Following	CodeCode Available	0
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection	Dec 4, 2023	Image to textobject-detection	—Unverified	0
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation	Nov 30, 2023	Image CaptioningReferring Expression	CodeCode Available	0
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions	Nov 28, 2023	DisentanglementReferring Expression	CodeCode Available	1
Continual Referring Expression Comprehension via Dual Modular Memorization	Nov 25, 2023	MemorizationReferring Expression	CodeCode Available	0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models	Nov 24, 2023	AllReferring Expression	—Unverified	0
Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models	Nov 21, 2023	Image SegmentationLanguage Modelling	CodeCode Available	0
NExT-Chat: An LMM for Chat, Detection and Segmentation	Nov 8, 2023	Referring ExpressionReferring Expression Segmentation	CodeCode Available	2
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs	Nov 8, 2023	Question AnsweringReferring Expression	CodeCode Available	1
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding	Nov 6, 2023	CoLAQuestion Answering	—Unverified	0
GLaMM: Pixel Grounding Large Multimodal Model	Nov 6, 2023	Conversational Question AnsweringImage Captioning	CodeCode Available	2
Towards Omni-supervised Referring Expression Segmentation	Nov 1, 2023	Referring ExpressionReferring Expression Segmentation	CodeCode Available	0
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation	Oct 27, 2023	Image SegmentationReferring Expression	—Unverified	0
Video Referring Expression Comprehension via Transformer with Content-conditioned Query	Oct 25, 2023	cross-modal alignmentReferring Expression	—Unverified	0
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V	Oct 17, 2023	Interactive SegmentationReferring Expression	CodeCode Available	4
Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs	Oct 1, 2023	Referring Expression	CodeCode Available	1
Multi-modal Domain Adaptation for REG via Relation Transfer	Sep 23, 2023	Domain Adaptationimage-classification	—Unverified	0
CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation	Sep 17, 2023	DecoderReferring Expression	—Unverified	0

Show:10 25 50

← PrevPage 5 of 15Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Random	Acc@0.5m	14.6	—	Unverified