SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 111120 of 364 papers

TitleStatusHype
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and CaptionsCode1
Continual Referring Expression Comprehension via Dual Modular MemorizationCode0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models0
Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language ModelsCode0
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEsCode1
NExT-Chat: An LMM for Chat, Detection and SegmentationCode2
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding0
GLaMM: Pixel Grounding Large Multimodal ModelCode2
Towards Omni-supervised Referring Expression SegmentationCode0
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation0
Show:102550
← PrevPage 12 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified