SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 101125 of 364 papers

TitleStatusHype
VL-BERT: Pre-training of Generic Visual-Linguistic RepresentationsCode1
A Fast and Accurate One-Stage Approach to Visual GroundingCode1
Relationship-Embedded Representation Learning for Grounding Referring ExpressionsCode1
Generating Easy-to-Understand Referring Expressions for Target IdentificationsCode1
Colors in Context: A Pragmatic Neural Model for Grounded Language UnderstandingCode1
Modeling Context in Referring ExpressionsCode1
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval0
Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language ModelsCode0
Referring Expression Instance Retrieval and A Strong End-to-End Baseline0
Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation0
Synthetic Visual Genome0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
Refer to Anything with Vision-Language Prompts0
Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning0
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions0
Improving Contrastive Learning for Referring Expression CountingCode0
Deformable Attentive Visual Enhancement for Referring Segmentation Using Vision-Language Model0
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and SegmentationCode0
Learning to Reason and Navigate: Parameter Efficient Action Planning with Large Language Models0
RESAnything: Attribute Prompting for Arbitrary Referring Segmentation0
Vision-Language Models Are Not Pragmatically Competent in Referring Expression GenerationCode0
LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation0
3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation0
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target GranularitiesCode0
MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote SensingCode0
Show:102550
← PrevPage 5 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified