SOTAVerified

Image Description

Papers

Showing 5175 of 154 papers

TitleStatusHype
Multimodal Word Sense Disambiguation in Creative PracticeCode0
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal EmbeddingsCode0
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal DatasetsCode0
Long-term Recurrent Convolutional Networks for Visual Recognition and DescriptionCode0
Measuring the Diversity of Automatic Image DescriptionsCode0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
Cross-linguistic differences and similarities in image descriptionsCode0
Large Language Models can Share Images, Too!Code0
Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image RetrievalCode0
ContextRef: Evaluating Referenceless Metrics For Image Description GenerationCode0
Localized Symbolic Knowledge Distillation for Visual Commonsense ModelsCode0
Contextualize, Show and Tell: A Neural Visual StorytellerCode0
MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning StepsCode0
Pragmatic factors in image description: the case of negationsCode0
Talking about other people: an endless range of possibilitiesCode0
Focused Evaluation for Image Description with Binary Forced-Choice Tasks0
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description0
A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching0
Facial Expression Recognition and Image Description Generation in Vietnamese0
Face2Text revisited: Improved data set and baseline results0
Exploring Visual Relationship for Image Captioning0
Computer Vision and Conflicting Values: Describing People with Automated Alt Text0
A Fine-Grained Image Description Generation Method Based on Joint Objectives0
Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis0
Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.