SOTAVerified

Image Description

Papers

Showing 51100 of 154 papers

TitleStatusHype
Face2Text revisited: Improved data set and baseline results0
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation0
Multimodal fusion via cortical network inspired losses0
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
Neural Dependency Coding inspired Multimodal Fusion0
CIDEr-R: Robust Consensus-based Image Description EvaluationCode0
Cross Modification Attention Based Deliberation Model for Image Captioning0
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant0
Zero-Shot Out-of-Distribution Detection Based on the Pre-trained Model CLIPCode1
Revisiting Binary Local Image Description for Resource Limited DevicesCode1
Computer Vision and Conflicting Values: Describing People with Automated Alt Text0
How Do Image Description Systems Describe People? A Targeted Assessment of System Competence in the PEOPLE-domainCode0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
EPYNET: Efficient Pyramidal Network for Clothing Segmentation0
Multimodal Word Sense Disambiguation in Creative PracticeCode0
On the use of human reference data for evaluating automatic image descriptions0
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs0
Textual Visual Semantic Dataset for Text Spotting0
On Architectures for Including Visual Information in Neural Language Models for Image DescriptionCode0
Tell Me More: A Dataset of Visual Scene Description Sequences0
A Hierarchical Approach for Visual Storytelling Using Image Description0
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions0
Place recognition in gardens by learning visual representations: data set and benchmark analysis0
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments0
What a neural language model tells us about spatial relationsCode0
The Lexical Gap: An Improved Measure of Automated Image Description Quality0
Multi-modal gated recurrent units for image description0
Human Attention in Image Captioning: Dataset and AnalysisCode0
Sequential Attention GAN for Interactive Image Editing0
Grounded Video DescriptionCode1
Unsupervised Image CaptioningCode0
Adding the Third Dimension to Spatial Relation Detection in 2D Images0
The Task Matters: Comparing Image Captioning and Task-Based Dialogical Image Description0
Talking about other people: an endless range of possibilitiesCode0
Recurrent Attention Unit0
Exploring Visual Relationship for Image Captioning0
Unsupervised Stylish Image Description Generation via Domain Layer Norm0
DIDEC: The Dutch Image Description and Eye-tracking Corpus0
Measuring the Diversity of Automatic Image DescriptionsCode0
Varying image description tasks: spoken versus written descriptionsCode0
Deep Imbalanced Attribute Classification using Visual Attention AggregationCode0
Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data0
Bridging Languages through Images with Deep Partial Canonical Correlation AnalysisCode0
Contextualize, Show and Tell: A Neural Visual StorytellerCode0
Generating Image Captions in Arabic using Root-Word Based Recurrent Neural Networks and Deep Neural Networks0
Multimodal Machine Translation with Reinforcement Learning0
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering0
Compositional Obverter Communication Learning From Raw Visual InputCode0
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face DescriptionsCode0
Zero-Resource Neural Machine Translation with Multi-Agent Communication Game0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.