SOTAVerified

Image to text

Papers

Showing 6170 of 246 papers

TitleStatusHype
Brain Captioning: Decoding human brain activity into images and textCode1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal CyclesCode1
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?Code1
Can MLLMs Perform Text-to-Image In-Context Learning?Code1
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-trainingCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report GenerationCode1
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning0
Show:102550
← PrevPage 7 of 25Next →

No leaderboard results yet.