SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 4150 of 364 papers

TitleStatusHype
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension0
Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding0
Towards Visual Grounding: A SurveyCode3
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression SegmentationCode1
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension0
Instance-Aware Generalized Referring Expression Segmentation0
Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation0
SegLLM: Multi-round Reasoning Segmentation0
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal ModelsCode0
Text4Seg: Reimagining Image Segmentation as Text GenerationCode2
Show:102550
← PrevPage 5 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified