| Local Higher-Order Statistics (LHS) describing images with statistics of local non-binarized pixel patterns | Oct 2, 2015 | Image DescriptionQuantization | —Unverified | 0 |
| Exemplar SVMs as Visual Feature Encoders | Jun 1, 2015 | image-classificationImage Classification | —Unverified | 0 |
| Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images | Sep 1, 2017 | Image DescriptionReferring Expression | —Unverified | 0 |
| Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis | Jan 13, 2025 | Image DescriptionTransfer Learning | —Unverified | 0 |
| Exploring Visual Relationship for Image Captioning | Sep 19, 2018 | DecoderImage Captioning | —Unverified | 0 |
| A Fine-Grained Image Description Generation Method Based on Joint Objectives | Sep 2, 2023 | Image DescriptionObject | —Unverified | 0 |
| Face2Text revisited: Improved data set and baseline results | May 24, 2022 | Image DescriptionTransfer Learning | —Unverified | 0 |
| Facial Expression Recognition and Image Description Generation in Vietnamese | Aug 12, 2022 | DescriptiveEmotion Recognition | —Unverified | 0 |
| A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching | Jun 1, 2013 | Image DescriptionVideo Description | —Unverified | 0 |
| Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description | Oct 19, 2017 | Image DescriptionMachine Translation | —Unverified | 0 |
| Annotation Methodologies for Vision and Language Dataset Creation | Jul 10, 2016 | Action RecognitionImage Description | —Unverified | 0 |
| Focused Evaluation for Image Description with Binary Forced-Choice Tasks | Aug 1, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Customized Image Narrative Generation via Interactive Visual Question Generation and Answering | Apr 27, 2018 | DiversityImage Description | —Unverified | 0 |
| Adaptive Color Attributes for Real-Time Visual Tracking | Jun 1, 2014 | AttributeImage Description | —Unverified | 0 |
| Curriculum Learning of Visual Attribute Clusters for Multi-Task Classification | Sep 19, 2017 | AttributeClassification | —Unverified | 0 |
| Curriculum Learning for Multi-Task Classification of Visual Attributes | Aug 29, 2017 | AttributeClassification | —Unverified | 0 |
| Boli: A dataset for understanding stuttering experience and analyzing stuttered speech | Jan 27, 2025 | Image Description | —Unverified | 0 |
| Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images | May 31, 2024 | AnatomyImage Description | —Unverified | 0 |
| Im2Text: Describing Images Using 1 Million Captioned Photographs | Dec 1, 2011 | Image CaptioningImage Description | —Unverified | 0 |
| Cross-validating Image Description Datasets and Evaluation Metrics | May 1, 2016 | Image DescriptionSentence | —Unverified | 0 |
| Image Description Dataset for Language Learners | Jun 1, 2022 | Image DescriptionSentence | —Unverified | 0 |
| Image Description using Visual Dependency Representations | Oct 1, 2013 | Image DescriptionImage Retrieval | —Unverified | 0 |
| Image Pivoting for Learning Multilingual Multimodal Representations | Jul 24, 2017 | Image DescriptionImage Retrieval | —Unverified | 0 |
| Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments | Jun 23, 2019 | Image DescriptionPerson Re-Identification | —Unverified | 0 |
| Cross Modification Attention Based Deliberation Model for Image Captioning | Sep 17, 2021 | DecoderDescriptive | —Unverified | 0 |