SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 151200 of 364 papers

TitleStatusHype
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training0
Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension0
Corpus-based Referring Expressions Generation0
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding0
Creating Training Corpora for NLG Micro-Planners0
Decoding Strategies for Neural Referring Expression Generation0
Decoupling Pragmatics: Discriminative Decoding for Referring Expression Generation0
Deformable Attentive Visual Enhancement for Referring Segmentation Using Vision-Language Model0
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension0
DisCLIP: Open-Vocabulary Referring Expression Generation0
Discovering User Groups for Natural Language Generation0
Dual Convolutional LSTM Network for Referring Image Segmentation0
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension0
Dynamic Graph Attention for Referring Expression Comprehension0
Dynamic Inference With Grounding Based Vision and Language Models0
Easy Things First: Installments Improve Referring Expression Generation for Objects in Photographs0
End-to-End Neural Context Reconstruction in Chinese Dialogue0
Event versus entity co-reference: Effects of context and form of referring expression0
Exploring Spatial Language Grounding Through Referring Expressions0
Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images0
FindIt: Generalized Localization with Natural Language Queries0
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
Fully and Weakly Supervised Referring Expression Segmentation with End-to-End Learning0
Fuzzy Logic for Vagueness Management in Referring Expression Generation0
Generalizable Entity Grounding via Assistance of Large Language Model0
Generating Quantified Referring Expressions through Attention-Driven Incremental Perception0
Generating Texts with Integer Linear Programming0
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing0
Gera \~ao de Express\~oes de Refer\^encia usando Rela \~oes Espaciais (Referring Expression Generation Using Spatial Relations) [in Portuguese]0
Getting to ``Hearer-old'': Charting Referring Expressions Across Time0
Goal-driven text descriptions for images0
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane0
Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation0
G-TUNA: a corpus of referring expressions in German, including duration information0
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension0
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension0
HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes0
Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities0
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing0
Improving the generation of personalised descriptions0
Improving the Naturalness and Diversity of Referring Expression Generation models using Minimum Risk Training0
Informativity in Image Captions vs. Referring Expressions0
Instance-Aware Generalized Referring Expression Segmentation0
Intrinsic Task-based Evaluation for Referring Expression Generation0
Justifying Corpus-Based Choices in Referring Expression Generation0
Key-Word-Aware Network for Referring Expression Image Segmentation0
Language Controls More Than Top-Down Attention: Modulating Bottom-Up Visual Processing with Referring Expressions0
Language-Guided 3D Object Detection in Point Cloud for Autonomous Driving0
Language-Mediated, Object-Centric Representation Learning0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified