SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 161170 of 364 papers

TitleStatusHype
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun DistillationCode1
SQA3D: Situated Question Answering in 3D ScenesCode1
Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset0
VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature AlignmentCode1
Video Referring Expression Comprehension via Transformer with Content-aware Query0
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic ApproachCode0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
Learning to Evaluate Performance of Multi-modal Semantic LocalizationCode1
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning0
Correspondence Matters for Video Referring Expression ComprehensionCode1
Show:102550
← PrevPage 17 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified