SOTAVerified

Image to text

Papers

Showing 91100 of 246 papers

TitleStatusHype
Evaluating Text-to-Visual Generation with Image-to-Text GenerationCode3
BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval0
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation0
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional ChangesCode1
MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant0
CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?0
Enhancing Vision-Language Pre-training with Rich Supervisions0
Attention Guidance Mechanism for Handwritten Mathematical Expression Recognition0
Probing Multimodal Large Language Models for Global and Local Semantic RepresentationsCode0
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models0
Show:102550
← PrevPage 10 of 25Next →

No leaderboard results yet.