SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 151200 of 364 papers

TitleStatusHype
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionCode5
CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression ComprehensionCode0
Learning To Segment Every Referring Object Point by PointCode0
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension0
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension0
Dynamic Inference With Grounding Based Vision and Language Models0
Fully and Weakly Supervised Referring Expression Segmentation with End-to-End Learning0
Layout-aware Dreamer for Embodied Referring Expression GroundingCode1
A Unified Mutual Supervision Framework for Referring Expression Segmentation and Generation0
Scene-Text Oriented Reffering Expression ComprehensionCode0
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun DistillationCode1
SQA3D: Situated Question Answering in 3D ScenesCode1
Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset0
VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature AlignmentCode1
Video Referring Expression Comprehension via Transformer with Content-aware Query0
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic ApproachCode0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
Learning to Evaluate Performance of Multi-modal Semantic LocalizationCode1
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning0
Correspondence Matters for Video Referring Expression ComprehensionCode1
Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression GroundingCode0
Improving Visual Grounding by Encouraging Consistent Gradient-based ExplanationsCode1
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
RefCrowd: Grounding the Target in Crowd with Referring Expressions0
Constructing Distributions of Variation in Referring Expression Type from Corpora for Model Evaluation0
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language ModelsCode1
Referring Expressions with Rational Speech Act Framework: A Probabilistic Approach0
Weakly-supervised segmentation of referring expressions0
HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes0
GRIT: General Robust Image Task BenchmarkCode1
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension0
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression ComprehensionCode1
The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary TextsCode1
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression ComprehensionCode1
FindIt: Generalized Localization with Natural Language Queries0
SeqTR: A Simple yet Universal Network for Visual GroundingCode1
Single-Stream Multi-Level Alignment for Vision-Language PretrainingCode0
Non-neural Models Matter: A Re-evaluation of Neural Referring Expression Generation Systems0
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching0
Lite-MDETR: A Lightweight Multi-Modal Detector0
Deconfounded Visual GroundingCode0
Image Segmentation Using Text and Image PromptsCode1
LAVT: Language-Aware Vision Transformer for Referring Image SegmentationCode1
Using Referring Expression Generation to Model Literary Style0
Robust Visual Reasoning via Language Guided Neural Module Networks0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
The Pipeline Model for Resolution of Anaphoric Reference and Resolution of Entity Reference0
Evaluating and Improving Interactions with Hazy Oracles0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified