SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 201225 of 364 papers

TitleStatusHype
Scene-Text Oriented Reffering Expression ComprehensionCode0
Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset0
Video Referring Expression Comprehension via Transformer with Content-aware Query0
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic ApproachCode0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning0
Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression GroundingCode0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
RefCrowd: Grounding the Target in Crowd with Referring Expressions0
Constructing Distributions of Variation in Referring Expression Type from Corpora for Model Evaluation0
Referring Expressions with Rational Speech Act Framework: A Probabilistic Approach0
Weakly-supervised segmentation of referring expressions0
HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes0
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension0
FindIt: Generalized Localization with Natural Language Queries0
Single-Stream Multi-Level Alignment for Vision-Language PretrainingCode0
Non-neural Models Matter: A Re-evaluation of Neural Referring Expression Generation Systems0
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching0
Lite-MDETR: A Lightweight Multi-Modal Detector0
Deconfounded Visual GroundingCode0
Robust Visual Reasoning via Language Guided Neural Module Networks0
Using Referring Expression Generation to Model Literary Style0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
Show:102550
← PrevPage 9 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified