SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 6170 of 364 papers

TitleStatusHype
Layout-aware Dreamer for Embodied Referring Expression GroundingCode1
MDETR -- Modulated Detection for End-to-End Multi-Modal UnderstandingCode1
Improving Visual Grounding by Encouraging Consistent Gradient-based ExplanationsCode1
Human-centric Spatio-Temporal Video Grounding With Visual TransformersCode1
IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression SegmentationCode1
GSVA: Generalized Segmentation via Multimodal Large Language ModelsCode1
3D-GRES: Generalized 3D Referring Expression SegmentationCode1
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression SegmentationCode1
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement LearningCode1
GRIT: General Robust Image Task BenchmarkCode1
Show:102550
← PrevPage 7 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified