| Recurrent Topic-Transition GAN for Visual Paragraph Generation | Mar 21, 2017 | Generative Adversarial NetworkImage Description | —Unverified | 0 |
| Doubly-Attentive Decoder for Multi-modal Neural Machine Translation | Feb 4, 2017 | DecoderImage Description | —Unverified | 0 |
| Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering | Dec 15, 2016 | DecoderImage Captioning | —Unverified | 0 |
| From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning | Oct 11, 2016 | FormGrounded language learning | —Unverified | 0 |
| Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data | Sep 8, 2016 | Action ClassificationClassification | —Unverified | 0 |
| Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation | Sep 1, 2016 | Image DescriptionImage Retrieval | —Unverified | 0 |
| phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning | Aug 20, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| A Shared Task on Multimodal Machine Translation and Crosslingual Image Description | Aug 1, 2016 | Image DescriptionImage Retrieval | —Unverified | 0 |
| Focused Evaluation for Image Description with Binary Forced-Choice Tasks | Aug 1, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Annotation Methodologies for Vision and Language Dataset Creation | Jul 10, 2016 | Action RecognitionImage Description | —Unverified | 0 |