SOTAVerified

Image Description

Papers

Showing 126150 of 154 papers

TitleStatusHype
Compositional Obverter Communication Learning From Raw Visual InputCode0
Efficient Decentralized Visual Place Recognition From Full-Image DescriptorsCode0
Talking about other people: an endless range of possibilitiesCode0
Human Attention in Image Captioning: Dataset and AnalysisCode0
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal EmbeddingsCode0
CIDEr-R: Robust Consensus-based Image Description EvaluationCode0
Pragmatic factors in image description: the case of negationsCode0
Large Language Models can Share Images, Too!Code0
Cross-linguistic differences and similarities in image descriptionsCode0
Varying image description tasks: spoken versus written descriptionsCode0
Localized Symbolic Knowledge Distillation for Visual Commonsense ModelsCode0
Long-term Recurrent Convolutional Networks for Visual Recognition and DescriptionCode0
Improving Visual-Semantic Embeddings by Learning Semantically-Enhanced Hard Negatives for Cross-modal Information RetrievalCode0
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal DatasetsCode0
Measuring the Diversity of Automatic Image DescriptionsCode0
MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning StepsCode0
What a neural language model tells us about spatial relationsCode0
Does Multimodality Help Human and Machine for Translation and Image Captioning?Code0
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMsCode0
Describing Videos by Exploiting Temporal StructureCode0
VisBias: Measuring Explicit and Implicit Social Biases in Vision Language ModelsCode0
Multi30K: Multilingual English-German Image DescriptionsCode0
Contextualize, Show and Tell: A Neural Visual StorytellerCode0
Multilingual Image Description with Neural Sequence ModelsCode0
Room for improvement in automatic image description: an error analysisCode0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.