| Multimodal Machine Translation with Reinforcement Learning | May 7, 2018 | Image DescriptionMachine Translation | —Unverified | 0 |
| Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data | Jul 1, 2018 | Image DescriptionMachine Translation | —Unverified | 0 |
| Neural Dependency Coding inspired Multimodal Fusion | Sep 28, 2021 | Emotion RecognitionImage Description | —Unverified | 0 |
| On the Use of Deep Learning for Blind Image Quality Assessment | Feb 17, 2016 | Blind Image Quality AssessmentImage Description | —Unverified | 0 |
| On the use of human reference data for evaluating automatic image descriptions | Jun 15, 2020 | Image Description | —Unverified | 0 |
| ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs | Apr 21, 2020 | Image CaptioningImage Description | —Unverified | 0 |
| Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis | Dec 4, 2024 | Image CaptioningImage Description | —Unverified | 0 |
| phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning | Aug 20, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Phrase-based Image Captioning with Hierarchical LSTM Model | Nov 11, 2017 | DecoderImage Captioning | —Unverified | 0 |
| Place recognition: An Overview of Vision Perspective | Jun 17, 2017 | image-classificationImage Classification | —Unverified | 0 |