SOTAVerified

Image Description

Papers

Showing 1120 of 154 papers

TitleStatusHype
SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline ModelsCode1
A skeletonization algorithm for gradient-based optimizationCode1
Grounded Video DescriptionCode1
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetCode1
Can Large Multimodal Models Uncover Deep Semantics Behind Images?Code1
Chatting Makes Perfect: Chat-based Image RetrievalCode1
CIDEr: Consensus-based Image Description EvaluationCode1
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence ModelsCode1
Text-Visual Semantic Constrained AI-Generated Image Quality AssessmentCode1
Show:102550
← PrevPage 2 of 16Next →

No leaderboard results yet.