SOTAVerified

Referring Expression Comprehension

Papers

Showing 101150 of 167 papers

TitleStatusHype
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection0
Continual Referring Expression Comprehension via Dual Modular MemorizationCode0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsCode0
Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language ModelsCode0
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding0
Video Referring Expression Comprehension via Transformer with Content-conditioned Query0
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasksCode0
Whether you can locate or not? Interactive Referring Expression GenerationCode0
Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks0
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input0
Language Adaptive Weight Generation for Multi-task Visual GroundingCode0
Referring Expression Comprehension Using Language Adaptive InferenceCode0
Language-Guided 3D Object Detection in Point Cloud for Autonomous Driving0
CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression ComprehensionCode0
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension0
Dynamic Inference With Grounding Based Vision and Language Models0
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension0
Scene-Text Oriented Reffering Expression ComprehensionCode0
Video Referring Expression Comprehension via Transformer with Content-aware Query0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
RefCrowd: Grounding the Target in Crowd with Referring Expressions0
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension0
FindIt: Generalized Localization with Natural Language Queries0
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Webly Supervised Concept Expansion for General Purpose Vision Models0
Lite-MDETR: A Lightweight Multi-Modal Detector0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
Evaluating and Improving Interactions with Hazy Oracles0
Towards Language-guided Visual Recognition via Dynamic ConvolutionsCode0
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic RepresentationCode0
Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention0
Playing Lottery Tickets with Vision and Language0
Understanding Synonymous Referring Expressions via Contrastive FeaturesCode0
Co-Grounding Networks with Semantic Attention for Referring Expression Comprehension in Videos0
Language-Mediated, Object-Centric Representation Learning0
PPGN: Phrase-Guided Proposal Generation Network For Referring Expression Comprehension0
Modular Graph Attention Network for Complex Visual Relational Reasoning0
ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments0
Language-Conditioned Feature Pyramids for Visual Selection TasksCode0
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary0
Cosine meets Softmax: A tough-to-beat baseline for visual groundingCode0
AttnGrounder: Talking to Cars with AttentionCode0
Referring Expression Comprehension: A Survey of Methods and Datasets0
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph0
Give Me Something to Eat: Referring Expression Comprehension with Commonsense KnowledgeCode0
Leveraging Non-Specialists for Accurate and Time Efficient AMR Annotation0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.