SOTAVerified

Image Description

Papers

Showing 51100 of 154 papers

TitleStatusHype
DiffCap: Exploring Continuous Diffusion on Image Captioning0
Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image RetrievalCode0
Improving Visual-Semantic Embeddings by Learning Semantically-Enhanced Hard Negatives for Cross-modal Information RetrievalCode0
Facial Expression Recognition and Image Description Generation in Vietnamese0
Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional NetworkCode0
Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset0
Image Description Dataset for Language Learners0
Face2Text revisited: Improved data set and baseline results0
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation0
Multimodal fusion via cortical network inspired losses0
CIDEr-R: Robust Consensus-based Image Description EvaluationCode0
Neural Dependency Coding inspired Multimodal Fusion0
Cross Modification Attention Based Deliberation Model for Image Captioning0
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant0
Computer Vision and Conflicting Values: Describing People with Automated Alt Text0
How Do Image Description Systems Describe People? A Targeted Assessment of System Competence in the PEOPLE-domainCode0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
EPYNET: Efficient Pyramidal Network for Clothing Segmentation0
Multimodal Word Sense Disambiguation in Creative PracticeCode0
On the use of human reference data for evaluating automatic image descriptions0
Textual Visual Semantic Dataset for Text Spotting0
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs0
On Architectures for Including Visual Information in Neural Language Models for Image DescriptionCode0
Tell Me More: A Dataset of Visual Scene Description Sequences0
A Hierarchical Approach for Visual Storytelling Using Image Description0
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions0
Place recognition in gardens by learning visual representations: data set and benchmark analysis0
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments0
What a neural language model tells us about spatial relationsCode0
The Lexical Gap: An Improved Measure of Automated Image Description Quality0
Multi-modal gated recurrent units for image description0
Human Attention in Image Captioning: Dataset and AnalysisCode0
Sequential Attention GAN for Interactive Image Editing0
Unsupervised Image CaptioningCode0
Talking about other people: an endless range of possibilitiesCode0
The Task Matters: Comparing Image Captioning and Task-Based Dialogical Image Description0
Adding the Third Dimension to Spatial Relation Detection in 2D Images0
Recurrent Attention Unit0
Exploring Visual Relationship for Image Captioning0
Unsupervised Stylish Image Description Generation via Domain Layer Norm0
DIDEC: The Dutch Image Description and Eye-tracking Corpus0
Varying image description tasks: spoken versus written descriptionsCode0
Measuring the Diversity of Automatic Image DescriptionsCode0
Deep Imbalanced Attribute Classification using Visual Attention AggregationCode0
Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data0
Bridging Languages through Images with Deep Partial Canonical Correlation AnalysisCode0
Contextualize, Show and Tell: A Neural Visual StorytellerCode0
Generating Image Captions in Arabic using Root-Word Based Recurrent Neural Networks and Deep Neural Networks0
Multimodal Machine Translation with Reinforcement Learning0
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.