SOTAVerified

Image Description

Papers

Showing 1120 of 154 papers

TitleStatusHype
A Preliminary Survey of Semantic Descriptive Model for Images0
Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis0
RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human FeedbackCode0
Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis0
TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models0
MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning StepsCode0
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMsCode0
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image DescriptionsCode2
Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images0
Data-augmented phrase-level alignment for mitigating object hallucination0
Show:102550
← PrevPage 2 of 16Next →

No leaderboard results yet.