SOTAVerified

Image to text

Papers

Showing 3140 of 246 papers

TitleStatusHype
Language-Oriented Semantic Latent Representation for Image TransmissionCode1
Benchmarking Large Multimodal Models against Common CorruptionsCode1
Language Quantized AutoEncoders: Towards Unsupervised Text-Image AlignmentCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
Improving Image Restoration through Removing Degradations in Textual RepresentationsCode1
Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and DesignCode1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language GenerationCode1
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal CyclesCode1
CMC-Bench: Towards a New Paradigm of Visual Signal CompressionCode1
Show:102550
← PrevPage 4 of 25Next →

No leaderboard results yet.