SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 261270 of 364 papers

TitleStatusHype
Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation0
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models0
G-TUNA: a corpus of referring expressions in German, including duration information0
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension0
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension0
HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed Scenes0
Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities0
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing0
Improving the generation of personalised descriptions0
Show:102550
← PrevPage 27 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified