SOTAVerified

Image Description

Papers

Showing 141150 of 154 papers

TitleStatusHype
Efficient Decentralized Visual Place Recognition From Full-Image DescriptorsCode0
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMsCode0
VisBias: Measuring Explicit and Implicit Social Biases in Vision Language ModelsCode0
Multi30K: Multilingual English-German Image DescriptionsCode0
Cross-linguistic differences and similarities in image descriptionsCode0
Multilingual Image Description with Neural Sequence ModelsCode0
Room for improvement in automatic image description: an error analysisCode0
RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human FeedbackCode0
Bounding and Filling: A Fast and Flexible Framework for Image CaptioningCode0
Contextualize, Show and Tell: A Neural Visual StorytellerCode0
Show:102550
← PrevPage 15 of 16Next →

No leaderboard results yet.