SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 101110 of 364 papers

TitleStatusHype
Multi-modal Instruction Tuned LLMs with Fine-grained Visual PerceptionCode1
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEsCode1
NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic ReasoningCode1
Multi-task Visual Grounding with Coarse-to-Fine Consistency ConstraintsCode1
UNITER: UNiversal Image-TExt Representation LearningCode1
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM CollaborationCode1
Deconfounded Visual GroundingCode0
A Joint Speaker-Listener-Reinforcer Model for Referring ExpressionsCode0
Reasoning About Pragmatics with Neural Listeners and SpeakersCode0
Cross-Modal Self-Attention Network for Referring Image SegmentationCode0
Show:102550
← PrevPage 11 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified