SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 3140 of 364 papers

TitleStatusHype
GSVA: Generalized Segmentation via Multimodal Large Language ModelsCode1
A Fast and Accurate One-Stage Approach to Visual GroundingCode1
A Recurrent Vision-and-Language BERT for NavigationCode1
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression SegmentationCode1
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word EmphasisCode1
Described Object Detection: Liberating Object Detection with Flexible ExpressionsCode1
Advancing Referring Expression Segmentation Beyond Single ImageCode1
GRIT: General Robust Image Task BenchmarkCode1
An Open and Comprehensive Pipeline for Unified Object Grounding and DetectionCode1
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image SegmentationCode1
Show:102550
← PrevPage 4 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified