SOTAVerified

Image to text

Papers

Showing 5160 of 246 papers

TitleStatusHype
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-trainingCode1
Language Quantized AutoEncoders: Towards Unsupervised Text-Image AlignmentCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal CyclesCode1
Can MLLMs Perform Text-to-Image In-Context Learning?Code1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language GenerationCode1
MAGVLT: Masked Generative Vision-and-Language TransformerCode1
Bootstrapping Vision-Language Learning with Decoupled Language Pre-trainingCode1
Improving Image Restoration through Removing Degradations in Textual RepresentationsCode1
Show:102550
← PrevPage 6 of 25Next →

No leaderboard results yet.