SOTAVerified

Image to text

Papers

Showing 191200 of 246 papers

TitleStatusHype
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning0
Adaptively Clustering Neighbor Elements for Image-Text GenerationCode0
SLAN: Self-Locator Aided Network for Vision-Language Understanding0
Do DALL-E and Flamingo Understand Each Other?0
When are Lemons Purple? The Concept Association Bias of Vision-Language Models0
MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering0
SLAN: Self-Locator Aided Network for Cross-Modal Understanding0
Retrieval-Augmented Multimodal Language Modeling0
Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision0
Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards0
Show:102550
← PrevPage 20 of 25Next →

No leaderboard results yet.