SOTAVerified

Referring Expression Comprehension

Papers

Showing 76100 of 167 papers

TitleStatusHype
Dynamic Graph Attention for Referring Expression Comprehension0
Dynamic Inference With Grounding Based Vision and Language Models0
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension0
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension0
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph0
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input0
Exploring Spatial Language Grounding Through Referring Expressions0
FindIt: Generalized Localization with Natural Language Queries0
Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks0
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis0
Deep Fragment Embeddings for Bidirectional Image Sentence Mapping0
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding0
Synthetic Visual Genome0
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing0
Giving Commands to a Self-driving Car: A Multimodal Reasoner for Visual Grounding0
Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension0
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension0
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension0
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training0
Video Referring Expression Comprehension via Transformer with Content-conditioned Query0
Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding0
Language-Guided 3D Object Detection in Point Cloud for Autonomous Driving0
Language-Mediated, Object-Centric Representation Learning0
Text-driven Affordance Learning from Egocentric Vision0
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.