SOTAVerified

Referring Expression Comprehension

Papers

Showing 76100 of 167 papers

TitleStatusHype
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
Scene-Text Oriented Reffering Expression ComprehensionCode0
Give Me Something to Eat: Referring Expression Comprehension with Commonsense KnowledgeCode0
MAttNet: Modular Attention Network for Referring Expression ComprehensionCode0
Cosine meets Softmax: A tough-to-beat baseline for visual groundingCode0
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal ModelsCode0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsCode0
Adversarial Robustness for Visual Grounding of Multimodal Large Language ModelsCode0
CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression ComprehensionCode0
A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection TrainingCode0
HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasksCode0
A Joint Speaker-Listener-Reinforcer Model for Referring ExpressionsCode0
Whether you can locate or not? Interactive Referring Expression GenerationCode0
Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language ModelsCode0
A Real-time Global Inference Network for One-stage Referring Expression ComprehensionCode0
Language Adaptive Weight Generation for Multi-task Visual GroundingCode0
Language-Conditioned Feature Pyramids for Visual Selection TasksCode0
Language-Conditioned Graph Networks for Relational ReasoningCode0
Understanding Synonymous Referring Expressions via Contrastive FeaturesCode0
Referring Expression Comprehension Using Language Adaptive InferenceCode0
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and SegmentationCode0
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic RepresentationCode0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
AttnGrounder: Talking to Cars with AttentionCode0
Natural Language Object RetrievalCode0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.