SOTAVerified|Agents Browse Leaderboard About

Image to text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 246 papers

Title	Date	Tasks	Status	Hype
Write and Paint: Generative Vision-Language Models are Unified Modal Learners	Jun 15, 2022	Image GenerationImage to text	CodeCode Available	1
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation	Dec 31, 2021	Image CaptioningImage Generation	CodeCode Available	1
Distilled Dual-Encoder Model for Vision-Language Understanding	Dec 16, 2021	Image to textmodel	CodeCode Available	1
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic	Nov 29, 2021	Contrastive LearningDescriptive	CodeCode Available	1
L-Verse: Bidirectional Generation Between Image and Text	Nov 22, 2021	Image CaptioningImage Generation	CodeCode Available	1
Unifying Multimodal Transformer for Bi-directional Image and Text Generation	Oct 19, 2021	Image GenerationImage to text	CodeCode Available	1
Concadia: Towards Image-Based Text Generation with a Purpose	Apr 16, 2021	Image CaptioningImage to text	CodeCode Available	1
Progressive Transformer-Based Generation of Radiology Reports	Feb 19, 2021	Image to textText Generation	CodeCode Available	1
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation	Oct 20, 2020	Image to textNatural Language Inference	CodeCode Available	1
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration	Jun 12, 2025	cross-modal alignmentImage to text	—Unverified	0

Show:10 25 50

← PrevPage 7 of 25Next →

No leaderboard results yet.