| ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs | Apr 21, 2020 | Image CaptioningImage Description | —Unverified | 0 |
| Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis | Dec 4, 2024 | Image CaptioningImage Description | —Unverified | 0 |
| phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning | Aug 20, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Phrase-based Image Captioning with Hierarchical LSTM Model | Nov 11, 2017 | DecoderImage Captioning | —Unverified | 0 |
| Place recognition: An Overview of Vision Perspective | Jun 17, 2017 | image-classificationImage Classification | —Unverified | 0 |
| Place recognition in gardens by learning visual representations: data set and benchmark analysis | Jun 28, 2019 | Camera LocalizationImage Description | —Unverified | 0 |
| Pragmatic descriptions of perceptual stimuli | Apr 1, 2017 | Image DescriptionObject Recognition | —Unverified | 0 |
| Recurrent Attention Unit | Oct 30, 2018 | General ClassificationHandwriting Recognition | —Unverified | 0 |
| Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering | Dec 15, 2016 | DecoderImage Captioning | —Unverified | 0 |
| Recurrent Topic-Transition GAN for Visual Paragraph Generation | Mar 21, 2017 | Generative Adversarial NetworkImage Description | —Unverified | 0 |
| SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant | Sep 14, 2021 | Image Description | —Unverified | 0 |
| Seeing the Unseen: Visual Common Sense for Semantic Placement | Jan 15, 2024 | Common Sense ReasoningImage Description | —Unverified | 0 |
| Sequential Attention GAN for Interactive Image Editing | Dec 20, 2018 | Image DescriptionImage Generation | —Unverified | 0 |
| Simple Image Description Generator via a Linear Phrase-Based Approach | Dec 29, 2014 | DescriptiveImage Description | —Unverified | 0 |
| Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition | Apr 23, 2017 | AttributeImage Captioning | —Unverified | 0 |
| Tell Me More: A Dataset of Visual Scene Description Sequences | Oct 1, 2019 | Image DescriptionSentence | —Unverified | 0 |
| Textual Visual Semantic Dataset for Text Spotting | Apr 21, 2020 | Image Descriptiontext similarity | —Unverified | 0 |
| The Image Torque Operator for Contour Processing | Jan 18, 2016 | Edge DetectionImage Description | —Unverified | 0 |
| The Lexical Gap: An Improved Measure of Automated Image Description Quality | May 1, 2019 | 2kDiversity | —Unverified | 0 |
| The Long-Short Story of Movie Description | Jun 4, 2015 | Image CaptioningImage Description | —Unverified | 0 |
| The Task Matters: Comparing Image Captioning and Task-Based Dialogical Image Description | Nov 1, 2018 | Image CaptioningImage Description | —Unverified | 0 |
| TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models | Nov 2, 2024 | Image DescriptionImage Generation | —Unverified | 0 |
| Unsupervised Stylish Image Description Generation via Domain Layer Norm | Sep 11, 2018 | Image Description | —Unverified | 0 |
| VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions | Jul 22, 2019 | Image DescriptionSemantic Similarity | —Unverified | 0 |
| Weakly Supervised Learning of Objects, Attributes and their Associations | Mar 31, 2015 | AttributeImage Description | —Unverified | 0 |