| Compositional Obverter Communication Learning From Raw Visual Input | Apr 6, 2018 | Image Description | CodeCode Available | 0 |
| Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions | Mar 10, 2018 | Image DescriptionImage to text | CodeCode Available | 0 |
| Zero-Resource Neural Machine Translation with Multi-Agent Communication Game | Feb 9, 2018 | DecoderImage Captioning | —Unverified | 0 |
| Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline) | Nov 26, 2017 | Image DescriptionPerson Re-Identification | CodeCode Available | 0 |
| E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks | Nov 20, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space | Nov 19, 2017 | Caption GenerationImage Description | —Unverified | 0 |
| A Genetic Algorithm Approach for ImageRepresentation Learning through Color Quantization | Nov 18, 2017 | Content-Based Image RetrievalImage Description | —Unverified | 0 |
| Phrase-based Image Captioning with Hierarchical LSTM Model | Nov 11, 2017 | DecoderImage Captioning | —Unverified | 0 |
| Draw and Tell: Multimodal Descriptions Outperform Verbal- or Sketch-Only Descriptions in an Image Retrieval Task | Nov 1, 2017 | Image DescriptionImage Retrieval | —Unverified | 0 |
| Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description | Oct 19, 2017 | Image DescriptionMachine Translation | —Unverified | 0 |
| Curriculum Learning of Visual Attribute Clusters for Multi-Task Classification | Sep 19, 2017 | AttributeClassification | —Unverified | 0 |
| Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images | Sep 1, 2017 | Image DescriptionReferring Expression | —Unverified | 0 |
| Curriculum Learning for Multi-Task Classification of Visual Attributes | Aug 29, 2017 | AttributeClassification | —Unverified | 0 |
| Image Pivoting for Learning Multilingual Multimodal Representations | Jul 24, 2017 | Image DescriptionImage Retrieval | —Unverified | 0 |
| Cross-linguistic differences and similarities in image descriptions | Jul 6, 2017 | Image DescriptionSpecificity | CodeCode Available | 0 |
| Place recognition: An Overview of Vision Perspective | Jun 17, 2017 | image-classificationImage Classification | —Unverified | 0 |
| Efficient Decentralized Visual Place Recognition From Full-Image Descriptors | May 30, 2017 | ClusteringImage Description | CodeCode Available | 0 |
| Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition | Apr 23, 2017 | AttributeImage Captioning | —Unverified | 0 |
| Room for improvement in automatic image description: an error analysis | Apr 13, 2017 | Image Description | CodeCode Available | 0 |
| Pragmatic descriptions of perceptual stimuli | Apr 1, 2017 | Image DescriptionObject Recognition | —Unverified | 0 |
| Recurrent Topic-Transition GAN for Visual Paragraph Generation | Mar 21, 2017 | Generative Adversarial NetworkImage Description | —Unverified | 0 |
| Doubly-Attentive Decoder for Multi-modal Neural Machine Translation | Feb 4, 2017 | DecoderImage Description | —Unverified | 0 |
| Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering | Dec 15, 2016 | DecoderImage Captioning | —Unverified | 0 |
| From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning | Oct 11, 2016 | FormGrounded language learning | —Unverified | 0 |
| Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data | Sep 8, 2016 | Action ClassificationClassification | —Unverified | 0 |
| Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation | Sep 1, 2016 | Image DescriptionImage Retrieval | —Unverified | 0 |
| phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning | Aug 20, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| A Shared Task on Multimodal Machine Translation and Crosslingual Image Description | Aug 1, 2016 | Image DescriptionImage Retrieval | —Unverified | 0 |
| Focused Evaluation for Image Description with Binary Forced-Choice Tasks | Aug 1, 2016 | Image CaptioningImage Description | —Unverified | 0 |
| Annotation Methodologies for Vision and Language Dataset Creation | Jul 10, 2016 | Action RecognitionImage Description | —Unverified | 0 |
| Pragmatic factors in image description: the case of negations | Jun 20, 2016 | Image DescriptionNegation | CodeCode Available | 0 |
| Does Multimodality Help Human and Machine for Translation and Image Captioning? | May 30, 2016 | Image CaptioningImage Description | CodeCode Available | 0 |
| Multi30K: Multilingual English-German Image Descriptions | May 2, 2016 | Image DescriptionMachine Translation | CodeCode Available | 0 |
| Cross-validating Image Description Datasets and Evaluation Metrics | May 1, 2016 | Image DescriptionSentence | —Unverified | 0 |
| Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings | Mar 30, 2016 | Image DescriptionImage Retrieval | CodeCode Available | 0 |
| On the Use of Deep Learning for Blind Image Quality Assessment | Feb 17, 2016 | Blind Image Quality AssessmentImage Description | —Unverified | 0 |
| The Image Torque Operator for Contour Processing | Jan 18, 2016 | Edge DetectionImage Description | —Unverified | 0 |
| Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures | Jan 15, 2016 | Image DescriptionRetrieval | —Unverified | 0 |
| Multilingual Image Description with Neural Sequence Models | Oct 15, 2015 | Image CaptioningImage Description | CodeCode Available | 0 |
| Local Higher-Order Statistics (LHS) describing images with statistics of local non-binarized pixel patterns | Oct 2, 2015 | Image DescriptionQuantization | —Unverified | 0 |
| The Long-Short Story of Movie Description | Jun 4, 2015 | Image CaptioningImage Description | —Unverified | 0 |
| Exemplar SVMs as Visual Feature Encoders | Jun 1, 2015 | image-classificationImage Classification | —Unverified | 0 |
| Mind's Eye: A Recurrent Visual Representation for Image Caption Generation | Jun 1, 2015 | Caption GenerationImage Description | —Unverified | 0 |
| Weakly Supervised Learning of Objects, Attributes and their Associations | Mar 31, 2015 | AttributeImage Description | —Unverified | 0 |
| Describing Videos by Exploiting Temporal Structure | Feb 27, 2015 | Action RecognitionImage Description | CodeCode Available | 0 |
| Simple Image Description Generator via a Linear Phrase-Based Approach | Dec 29, 2014 | DescriptiveImage Description | —Unverified | 0 |
| The Treasure beneath Convolutional Layers: Cross-convolutional-layer Pooling for Image Classification | Nov 27, 2014 | General Classificationimage-classification | CodeCode Available | 0 |
| Long-term Recurrent Convolutional Networks for Visual Recognition and Description | Nov 17, 2014 | Image DescriptionRetrieval | CodeCode Available | 0 |
| Collecting Image Description Datasets using Crowdsourcing | Nov 12, 2014 | Image DescriptionSentence | —Unverified | 0 |
| Adaptive Color Attributes for Real-Time Visual Tracking | Jun 1, 2014 | AttributeImage Description | —Unverified | 0 |