SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 201250 of 364 papers

TitleStatusHype
Scene-Text Oriented Reffering Expression ComprehensionCode0
Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset0
Video Referring Expression Comprehension via Transformer with Content-aware Query0
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic ApproachCode0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning0
Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression GroundingCode0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
RefCrowd: Grounding the Target in Crowd with Referring Expressions0
Constructing Distributions of Variation in Referring Expression Type from Corpora for Model Evaluation0
Referring Expressions with Rational Speech Act Framework: A Probabilistic Approach0
Weakly-supervised segmentation of referring expressions0
HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes0
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension0
FindIt: Generalized Localization with Natural Language Queries0
Single-Stream Multi-Level Alignment for Vision-Language PretrainingCode0
Non-neural Models Matter: A Re-evaluation of Neural Referring Expression Generation Systems0
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching0
Lite-MDETR: A Lightweight Multi-Modal Detector0
Deconfounded Visual GroundingCode0
Robust Visual Reasoning via Language Guided Neural Module Networks0
Using Referring Expression Generation to Model Literary Style0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
The Pipeline Model for Resolution of Anaphoric Reference and Resolution of Entity Reference0
Evaluating and Improving Interactions with Hazy Oracles0
Towards Language-guided Visual Recognition via Dynamic ConvolutionsCode0
Decoupling Pragmatics: Discriminative Decoding for Referring Expression Generation0
Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolutionCode0
Goal-driven text descriptions for images0
What can Neural Referential Form Selectors Learn?0
Enriching the E2E datasetCode0
VLN BERT: A Recurrent Vision-and-Language BERT for Navigation0
Bridging the Gap Between Object Detection and User Intent via Query-Modulation0
Giving Commands to a Self-Driving Car: How to Deal with Uncertain Situations?Code0
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic RepresentationCode0
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching0
Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention0
Playing Lottery Tickets with Vision and Language0
Understanding Synonymous Referring Expressions via Contrastive FeaturesCode0
Perspective-corrected Spatial Referring Expression Generation for Human-Robot Interaction0
Scene-Intuitive Agent for Remote Embodied Visual Grounding0
Co-Grounding Networks with Semantic Attention for Referring Expression Comprehension in Videos0
Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network0
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation0
Language Controls More Than Top-Down Attention: Modulating Bottom-Up Visual Processing with Referring Expressions0
Language-Mediated, Object-Centric Representation Learning0
PPGN: Phrase-Guided Proposal Generation Network For Referring Expression Comprehension0
CoNAN: A Complementary Neighboring-based Attention Network for Referring Expression Generation0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified