SOTAVerified

Person-centric Visual Grounding

Person-centric visual grounding is the problem of linking between people named in a caption and people pictured in an image. Introduced in "Who's Waldo? Linking People Across Text and Images" (Cui et al, ICCV 2021).

Papers

Showing 14 of 4 papers

TitleStatusHype
TubeDETR: Spatio-Temporal Video Grounding with TransformersCode1
Who's Waldo? Linking People Across Text and ImagesCode1
To Find Waldo You Need Contextual Cues: Debiasing Who's WaldoCode0
To Find Waldo You Need Contextual Cues: Debiasing Who’s WaldoCode0
Show:102550

No leaderboard results yet.