| Focused Evaluation for Image Description with Binary Forced-Choice Tasks | Aug 1, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Annotation Methodologies for Vision and Language Dataset Creation | Jul 10, 2016 | Action RecognitionImage Description | —Unverified | 0 |
| Pragmatic factors in image description: the case of negations | Jun 20, 2016 | Image DescriptionNegation | CodeCode Available | 0 |
| Does Multimodality Help Human and Machine for Translation and Image Captioning? | May 30, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Multi30K: Multilingual English-German Image Descriptions | May 2, 2016 | Image DescriptionMachine Translation | CodeCode Available | 0 |
| Cross-validating Image Description Datasets and Evaluation Metrics | May 1, 2016 | Image DescriptionSentence | —Unverified | 0 |
| Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings | Mar 30, 2016 | Image DescriptionImage Retrieval | CodeCode Available | 0 |
| Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations | Feb 23, 2016 | image-classificationImage Classification | CodeCode Available | 1 |
| On the Use of Deep Learning for Blind Image Quality Assessment | Feb 17, 2016 | Blind Image Quality AssessmentImage Description | —Unverified | 0 |
| The Image Torque Operator for Contour Processing | Jan 18, 2016 | Edge DetectionImage Description | —Unverified | 0 |
| Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures | Jan 15, 2016 | Image DescriptionRetrieval | —Unverified | 0 |
| Multilingual Image Description with Neural Sequence Models | Oct 15, 2015 | Image CaptioningImage Description | CodeCode Available | 0 |
| Local Higher-Order Statistics (LHS) describing images with statistics of local non-binarized pixel patterns | Oct 2, 2015 | Image DescriptionQuantization | —Unverified | 0 |
| The Long-Short Story of Movie Description | Jun 4, 2015 | Image CaptioningImage Description | —Unverified | 0 |
| Exemplar SVMs as Visual Feature Encoders | Jun 1, 2015 | image-classificationImage Classification | —Unverified | 0 |
| Mind's Eye: A Recurrent Visual Representation for Image Caption Generation | Jun 1, 2015 | Caption GenerationImage Description | —Unverified | 0 |
| Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models | May 19, 2015 | Image DescriptionPhrase Grounding | CodeCode Available | 1 |
| Weakly Supervised Learning of Objects, Attributes and their Associations | Mar 31, 2015 | AttributeImage Description | —Unverified | 0 |
| Describing Videos by Exploiting Temporal Structure | Feb 27, 2015 | Action RecognitionImage Description | CodeCode Available | 0 |
| Simple Image Description Generator via a Linear Phrase-Based Approach | Dec 29, 2014 | DescriptiveImage Description | —Unverified | 0 |
| The Treasure beneath Convolutional Layers: Cross-convolutional-layer Pooling for Image Classification | Nov 27, 2014 | General Classificationimage-classification | CodeCode Available | 0 |
| CIDEr: Consensus-based Image Description Evaluation | Nov 20, 2014 | Action RecognitionAttribute | CodeCode Available | 1 |
| Long-term Recurrent Convolutional Networks for Visual Recognition and Description | Nov 17, 2014 | Image DescriptionRetrieval | CodeCode Available | 0 |
| Collecting Image Description Datasets using Crowdsourcing | Nov 12, 2014 | Image DescriptionSentence | —Unverified | 0 |
| Comparing Automatic Evaluation Measures for Image Description | Jun 1, 2014 | Image DescriptionSlot Filling | —Unverified | 0 |