SOTAVerified

Image to text

Papers

Showing 6170 of 246 papers

TitleStatusHype
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic0
GPC: Generative and General Pathology Image Classifier0
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image RetrievalCode2
15M Multimodal Facial Image-Text Dataset0
Towards a text-based quantitative and explainable histopathology image analysisCode0
HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels0
Vision-Braille: An End-to-End Tool for Chinese Braille Image-to-Text Translation0
Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything0
A Data-Driven Guided Decoding Mechanism for Diagnostic CaptioningCode0
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags0
Show:102550
← PrevPage 7 of 25Next →

No leaderboard results yet.