SOTAVerified

Image Description

Papers

Showing 101150 of 154 papers

TitleStatusHype
Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline)Code0
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks0
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space0
A Genetic Algorithm Approach for ImageRepresentation Learning through Color Quantization0
Phrase-based Image Captioning with Hierarchical LSTM Model0
Draw and Tell: Multimodal Descriptions Outperform Verbal- or Sketch-Only Descriptions in an Image Retrieval Task0
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description0
Curriculum Learning of Visual Attribute Clusters for Multi-Task Classification0
Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images0
Curriculum Learning for Multi-Task Classification of Visual Attributes0
Image Pivoting for Learning Multilingual Multimodal Representations0
Cross-linguistic differences and similarities in image descriptionsCode0
Place recognition: An Overview of Vision Perspective0
Efficient Decentralized Visual Place Recognition From Full-Image DescriptorsCode0
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition0
Room for improvement in automatic image description: an error analysisCode0
Pragmatic descriptions of perceptual stimuli0
Recurrent Topic-Transition GAN for Visual Paragraph Generation0
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation0
Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering0
From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning0
Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data0
Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation0
phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning0
A Shared Task on Multimodal Machine Translation and Crosslingual Image Description0
Focused Evaluation for Image Description with Binary Forced-Choice Tasks0
Annotation Methodologies for Vision and Language Dataset Creation0
Pragmatic factors in image description: the case of negationsCode0
Does Multimodality Help Human and Machine for Translation and Image Captioning?Code0
Multi30K: Multilingual English-German Image DescriptionsCode0
Cross-validating Image Description Datasets and Evaluation Metrics0
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal EmbeddingsCode0
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image AnnotationsCode1
On the Use of Deep Learning for Blind Image Quality Assessment0
The Image Torque Operator for Contour Processing0
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures0
Multilingual Image Description with Neural Sequence ModelsCode0
Local Higher-Order Statistics (LHS) describing images with statistics of local non-binarized pixel patterns0
The Long-Short Story of Movie Description0
Exemplar SVMs as Visual Feature Encoders0
Mind's Eye: A Recurrent Visual Representation for Image Caption Generation0
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence ModelsCode1
Weakly Supervised Learning of Objects, Attributes and their Associations0
Describing Videos by Exploiting Temporal StructureCode0
Simple Image Description Generator via a Linear Phrase-Based Approach0
The Treasure beneath Convolutional Layers: Cross-convolutional-layer Pooling for Image ClassificationCode0
CIDEr: Consensus-based Image Description EvaluationCode1
Long-term Recurrent Convolutional Networks for Visual Recognition and DescriptionCode0
Collecting Image Description Datasets using Crowdsourcing0
Comparing Automatic Evaluation Measures for Image Description0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.