SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 7180 of 364 papers

TitleStatusHype
Correspondence Matters for Video Referring Expression ComprehensionCode1
Improving Visual Grounding by Encouraging Consistent Gradient-based ExplanationsCode1
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language ModelsCode1
GRIT: General Robust Image Task BenchmarkCode1
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression ComprehensionCode1
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression ComprehensionCode1
The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary TextsCode1
SeqTR: A Simple yet Universal Network for Visual GroundingCode1
Image Segmentation Using Text and Image PromptsCode1
LAVT: Language-Aware Vision Transformer for Referring Image SegmentationCode1
Show:102550
← PrevPage 8 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified