SOTAVerified

Referring Expression Comprehension

Papers

Showing 101150 of 167 papers

TitleStatusHype
Leveraging Non-Specialists for Accurate and Time Efficient AMR Annotation0
Lite-MDETR: A Lightweight Multi-Modal Detector0
The Solution for the 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge0
Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects0
M^2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension0
Make Graph-based Referring Expression Comprehension Great Again through Expression-guided Dynamic Gating and Regression0
ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments0
MaskInversion: Localized Embeddings via Optimization of Explainability Maps0
A Real-Time Cross-modality Correlation Filtering Method for Referring Expression Comprehension0
Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction0
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary0
Evaluating and Improving Interactions with Hazy Oracles0
Modular Graph Attention Network for Complex Visual Relational Reasoning0
Webly Supervised Concept Expansion for General Purpose Vision Models0
MUTATT: Visual-Textual Mutual Guidance for Referring Expression Comprehension0
Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks0
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning0
Co-Grounding Networks with Semantic Attention for Referring Expression Comprehension in Videos0
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries0
Playing Lottery Tickets with Vision and Language0
VQD: Visual Query Detection in Natural Scenes0
PPGN: Phrase-Guided Proposal Generation Network For Referring Expression Comprehension0
Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention0
PropTest: Automatic Property Testing for Improved Visual Programming0
Real-Time Referring Expression Comprehension by Single-Stage Grounding Network0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
UNITER: Learning UNiversal Image-TExt Representations0
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension0
RefCrowd: Grounding the Target in Crowd with Referring Expressions0
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar0
Referring Expression Comprehension: A Survey of Methods and Datasets0
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension0
Referring Expression Instance Retrieval and A Strong End-to-End Baseline0
Video Referring Expression Comprehension via Transformer with Content-aware Query0
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension0
A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection TrainingCode0
Whether you can locate or not? Interactive Referring Expression GenerationCode0
Continual Referring Expression Comprehension via Dual Modular MemorizationCode0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsCode0
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and SegmentationCode0
Towards Language-guided Visual Recognition via Dynamic ConvolutionsCode0
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring ExpressionsCode0
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic RepresentationCode0
Scene-Text Oriented Reffering Expression ComprehensionCode0
Language Adaptive Weight Generation for Multi-task Visual GroundingCode0
Adversarial Robustness for Visual Grounding of Multimodal Large Language ModelsCode0
MAttNet: Modular Attention Network for Referring Expression ComprehensionCode0
CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression ComprehensionCode0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.