SOTAVerified

Image to text

Papers

Showing 6170 of 246 papers

TitleStatusHype
Write and Paint: Generative Vision-Language Models are Unified Modal LearnersCode1
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language GenerationCode1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticCode1
L-Verse: Bidirectional Generation Between Image and TextCode1
Unifying Multimodal Transformer for Bi-directional Image and Text GenerationCode1
Concadia: Towards Image-Based Text Generation with a PurposeCode1
Progressive Transformer-Based Generation of Radiology ReportsCode1
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report GenerationCode1
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration0
Show:102550
← PrevPage 7 of 25Next →

No leaderboard results yet.