SOTAVerified

Image to text

Papers

Showing 2650 of 246 papers

TitleStatusHype
Language Quantized AutoEncoders: Towards Unsupervised Text-Image AlignmentCode1
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMsCode1
Beyond One-to-One: Rethinking the Referring Image SegmentationCode1
Concadia: Towards Image-Based Text Generation with a PurposeCode1
Brain Captioning: Decoding human brain activity into images and textCode1
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?Code1
PRIOR: Prototype Representation Joint Learning from Medical Images and ReportsCode1
CMC-Bench: Towards a New Paradigm of Visual Signal CompressionCode1
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional ChangesCode1
Multimodal Foundation Models For Echocardiogram InterpretationCode1
Benchmarking Large Multimodal Models against Common CorruptionsCode1
MAGVLT: Masked Generative Vision-and-Language TransformerCode1
Multimodal Procedural Planning via Dual Text-Image PromptingCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report GenerationCode1
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer TextCode1
Vision-Language Dataset DistillationCode1
Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and DesignCode1
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?Code1
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-trainingCode1
L-Verse: Bidirectional Generation Between Image and TextCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal CyclesCode1
Can MLLMs Perform Text-to-Image In-Context Learning?Code1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
Show:102550
← PrevPage 2 of 10Next →

No leaderboard results yet.