SOTAVerified

Image to text

Papers

Showing 2130 of 246 papers

TitleStatusHype
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image RetrievalCode2
Improving Image Restoration through Removing Degradations in Textual RepresentationsCode1
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report GenerationCode1
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?Code1
Beyond One-to-One: Rethinking the Referring Image SegmentationCode1
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-trainingCode1
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?Code1
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language GenerationCode1
Bootstrapping Vision-Language Learning with Decoupled Language Pre-trainingCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
Show:102550
← PrevPage 3 of 25Next →

No leaderboard results yet.