SOTAVerified

Image to text

Papers

Showing 141150 of 246 papers

TitleStatusHype
DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models0
DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding0
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning0
Doc2Im: document to image conversion through self-attentive embedding0
DOCCI: Descriptions of Connected and Contrasting Images0
Do DALL-E and Flamingo Understand Each Other?0
Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection0
Dynamic Traceback Learning for Medical Report Generation0
Efficient End-to-End Visual Document Understanding with Rationale Distillation0
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval0
Show:102550
← PrevPage 15 of 25Next →

No leaderboard results yet.