SOTAVerified

Image to text

Papers

Showing 3140 of 246 papers

TitleStatusHype
CMC-Bench: Towards a New Paradigm of Visual Signal CompressionCode1
Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and DesignCode1
Language-Oriented Semantic Latent Representation for Image TransmissionCode1
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?Code1
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional ChangesCode1
Can MLLMs Perform Text-to-Image In-Context Learning?Code1
Benchmarking Large Multimodal Models against Common CorruptionsCode1
Improving Image Restoration through Removing Degradations in Textual RepresentationsCode1
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language ModelsCode1
UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the WebCode1
Show:102550
← PrevPage 4 of 25Next →

No leaderboard results yet.