SOTAVerified

Image to text

Papers

Showing 131140 of 246 papers

TitleStatusHype
Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report GenerationCode0
DOCCI: Descriptions of Connected and Contrasting Images0
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation0
Leveraging AI to Generate Audio for User-generated Content in Video Games0
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical AlterationsCode0
Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection0
OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation0
BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval0
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation0
CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?0
Show:102550
← PrevPage 14 of 25Next →

No leaderboard results yet.