SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 5160 of 364 papers

TitleStatusHype
Learning to Evaluate Performance of Multi-modal Semantic LocalizationCode1
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionCode1
Correspondence Matters for Video Referring Expression ComprehensionCode1
MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression ComprehensionCode1
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement LearningCode1
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image SegmentationCode1
Relationship-Embedded Representation Learning for Grounding Referring ExpressionsCode1
Airbert: In-domain Pretraining for Vision-and-Language NavigationCode1
Multi-task Collaborative Network for Joint Referring Expression Comprehension and SegmentationCode1
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEsCode1
Show:102550
← PrevPage 6 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified