| Multimodal Word Sense Disambiguation in Creative Practice | Jul 15, 2020 | ClassificationDescriptive | CodeCode Available | 0 | 5 |
| Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings | Mar 30, 2016 | Image DescriptionImage Retrieval | CodeCode Available | 0 | 5 |
| MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Mar 5, 2024 | DiversityImage Description | CodeCode Available | 0 | 5 |
| Long-term Recurrent Convolutional Networks for Visual Recognition and Description | Nov 17, 2014 | Image DescriptionRetrieval | CodeCode Available | 0 | 5 |
| Measuring the Diversity of Automatic Image Descriptions | Aug 1, 2018 | DiversityImage Description | CodeCode Available | 0 | 5 |
| Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze | Nov 9, 2020 | cross-modal alignmentImage Captioning | CodeCode Available | 0 | 5 |
| Cross-linguistic differences and similarities in image descriptions | Jul 6, 2017 | Image DescriptionSpecificity | CodeCode Available | 0 | 5 |
| Large Language Models can Share Images, Too! | Oct 23, 2023 | Image DescriptionSentence | CodeCode Available | 0 | 5 |
| Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image Retrieval | Jan 1, 2023 | BinarizationImage Description | CodeCode Available | 0 | 5 |
| ContextRef: Evaluating Referenceless Metrics For Image Description Generation | Sep 21, 2023 | Image Description | CodeCode Available | 0 | 5 |
| Localized Symbolic Knowledge Distillation for Visual Commonsense Models | Dec 8, 2023 | Image DescriptionInstruction Following | CodeCode Available | 0 | 5 |
| Contextualize, Show and Tell: A Neural Visual Storyteller | Jun 3, 2018 | DecoderImage Description | CodeCode Available | 0 | 5 |
| MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps | Oct 18, 2024 | Image DescriptionInformativeness | CodeCode Available | 0 | 5 |
| Pragmatic factors in image description: the case of negations | Jun 20, 2016 | Image DescriptionNegation | CodeCode Available | 0 | 5 |
| Talking about other people: an endless range of possibilities | Nov 1, 2018 | Image DescriptionText Generation | CodeCode Available | 0 | 5 |
| Focused Evaluation for Image Description with Binary Forced-Choice Tasks | Aug 1, 2016 | Image CaptioningImage Description | —Unverified | 0 | 0 |
| Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description | Oct 19, 2017 | Image DescriptionMachine Translation | —Unverified | 0 | 0 |
| A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching | Jun 1, 2013 | Image DescriptionVideo Description | —Unverified | 0 | 0 |
| Facial Expression Recognition and Image Description Generation in Vietnamese | Aug 12, 2022 | DescriptiveEmotion Recognition | —Unverified | 0 | 0 |
| Face2Text revisited: Improved data set and baseline results | May 24, 2022 | Image DescriptionTransfer Learning | —Unverified | 0 | 0 |
| Exploring Visual Relationship for Image Captioning | Sep 19, 2018 | DecoderImage Captioning | —Unverified | 0 | 0 |
| Computer Vision and Conflicting Values: Describing People with Automated Alt Text | May 26, 2021 | Image Description | —Unverified | 0 | 0 |
| A Fine-Grained Image Description Generation Method Based on Joint Objectives | Sep 2, 2023 | Image DescriptionObject | —Unverified | 0 | 0 |
| Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis | Jan 13, 2025 | Image DescriptionTransfer Learning | —Unverified | 0 | 0 |
| Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images | Sep 1, 2017 | Image DescriptionReferring Expression | —Unverified | 0 | 0 |