SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 151200 of 364 papers

TitleStatusHype
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Vision-Language Models Are Not Pragmatically Competent in Referring Expression GenerationCode0
Cross-Modal Self-Attention Network for Referring Image SegmentationCode0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
Visual Referring Expression Recognition: What Do Systems Actually Learn?Code0
Adversarial Robustness for Visual Grounding of Multimodal Large Language ModelsCode0
Single-Stream Multi-Level Alignment for Vision-Language PretrainingCode0
Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive TeachersCode0
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring ExpressionsCode0
Learning To Segment Every Referring Object Point by PointCode0
A Joint Speaker-Listener-Reinforcer Model for Referring ExpressionsCode0
Continual Referring Expression Comprehension via Dual Modular MemorizationCode0
Adaptive Reconstruction Network for Weakly Supervised Referring Expression GroundingCode0
MAttNet: Modular Attention Network for Referring Expression ComprehensionCode0
Reasoning About Pragmatics with Neural Listeners and SpeakersCode0
Deconfounded Visual GroundingCode0
A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection TrainingCode0
Grounding Referring Expressions in Images by Variational ContextCode0
Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding0
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation0
Text-driven Affordance Learning from Egocentric Vision0
The Methodius Corpus of Rhetorical Discourse Structures and Generated Texts0
The Pipeline Model for Resolution of Anaphoric Reference and Resolution of Entity Reference0
The Solution for the 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge0
The WebNLG Challenge: Generating Text from RDF Data0
Toward Forgetting-Sensitive Referring Expression Generationfor Integrated Robot Architectures0
Towards Situated Dialogue: Revisiting Referring Expression Generation0
Transcrib3D: 3D Referring Expression Resolution through Large Language Models0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
UNITER: Learning UNiversal Image-TExt Representations0
Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching0
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos0
Using Lexical Alignment and Referring Ability to Address Data Sparsity in Situated Dialog Reference Resolution0
Using Referring Expression Generation to Model Literary Style0
Utilizing Every Image Object for Semi-supervised Phrase Grounding0
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions0
Video Referring Expression Comprehension via Transformer with Content-aware Query0
Video Referring Expression Comprehension via Transformer with Content-conditioned Query0
Viewpoint-Aware Visual Grounding in 3D Scenes0
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation0
VLN BERT: A Recurrent Vision-and-Language BERT for Navigation0
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching0
VQD: Visual Query Detection in Natural Scenes0
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar0
Weakly-supervised segmentation of referring expressions0
What can Neural Referential Form Selectors Learn?0
Trainable Referring Expression Generation using Overspecification Preferences0
3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation0
A case study on context-bound referring expression generation0
A Commercial Perspective on Reference0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified