| From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning | Oct 11, 2016 | FormGrounded language learning | —Unverified | 0 |
| Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data | Sep 8, 2016 | Action ClassificationClassification | —Unverified | 0 |
| Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation | Sep 1, 2016 | Image DescriptionImage Retrieval | —Unverified | 0 |
| phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning | Aug 20, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| A Shared Task on Multimodal Machine Translation and Crosslingual Image Description | Aug 1, 2016 | Image DescriptionImage Retrieval | —Unverified | 0 |
| Focused Evaluation for Image Description with Binary Forced-Choice Tasks | Aug 1, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Annotation Methodologies for Vision and Language Dataset Creation | Jul 10, 2016 | Action RecognitionImage Description | —Unverified | 0 |
| Pragmatic factors in image description: the case of negations | Jun 20, 2016 | Image DescriptionNegation | CodeCode Available | 0 |
| Does Multimodality Help Human and Machine for Translation and Image Captioning? | May 30, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Multi30K: Multilingual English-German Image Descriptions | May 2, 2016 | Image DescriptionMachine Translation | CodeCode Available | 0 |