SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 151200 of 364 papers

TitleStatusHype
Co-Grounding Networks with Semantic Attention for Referring Expression Comprehension in Videos0
Combining Referring Expression Generation and Surface Realization: A Corpus-Based Investigation of Architectures0
Evaluating and Improving Interactions with Hazy Oracles0
Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction0
Comprehension-guided referring expressions0
Computational Interpretations of Recency for the Choice of Referring Expressions in Discourse0
CoNAN: A Complementary Neighboring-based Attention Network for Referring Expression Generation0
Constructing Distributions of Variation in Referring Expression Type from Corpora for Model Evaluation0
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training0
Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension0
Corpus-based Referring Expressions Generation0
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding0
Creating Training Corpora for NLG Micro-Planners0
Decoding Strategies for Neural Referring Expression Generation0
Decoupling Pragmatics: Discriminative Decoding for Referring Expression Generation0
Deformable Attentive Visual Enhancement for Referring Segmentation Using Vision-Language Model0
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension0
DisCLIP: Open-Vocabulary Referring Expression Generation0
Discovering User Groups for Natural Language Generation0
Dual Convolutional LSTM Network for Referring Image Segmentation0
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension0
Dynamic Graph Attention for Referring Expression Comprehension0
Dynamic Inference With Grounding Based Vision and Language Models0
Easy Things First: Installments Improve Referring Expression Generation for Objects in Photographs0
End-to-End Neural Context Reconstruction in Chinese Dialogue0
Event versus entity co-reference: Effects of context and form of referring expression0
Exploring Spatial Language Grounding Through Referring Expressions0
Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images0
FindIt: Generalized Localization with Natural Language Queries0
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
Fully and Weakly Supervised Referring Expression Segmentation with End-to-End Learning0
Fuzzy Logic for Vagueness Management in Referring Expression Generation0
Generalizable Entity Grounding via Assistance of Large Language Model0
Generating Quantified Referring Expressions through Attention-Driven Incremental Perception0
Generating Texts with Integer Linear Programming0
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing0
Gera \~ao de Express\~oes de Refer\^encia usando Rela \~oes Espaciais (Referring Expression Generation Using Spatial Relations) [in Portuguese]0
Getting to ``Hearer-old'': Charting Referring Expressions Across Time0
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge0
Goal-driven text descriptions for images0
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane0
Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation0
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models0
G-TUNA: a corpus of referring expressions in German, including duration information0
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension0
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension0
HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes0
Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified