SOTAVerified

Image Description

Papers

Showing 51100 of 154 papers

TitleStatusHype
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures0
Boli: A dataset for understanding stuttering experience and analyzing stuttered speech0
Collecting Image Description Datasets using Crowdsourcing0
Comparing Automatic Evaluation Measures for Image Description0
Computer Vision and Conflicting Values: Describing People with Automated Alt Text0
Cross Modification Attention Based Deliberation Model for Image Captioning0
Cross-validating Image Description Datasets and Evaluation Metrics0
Curriculum Learning for Multi-Task Classification of Visual Attributes0
Curriculum Learning of Visual Attribute Clusters for Multi-Task Classification0
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering0
DIDEC: The Dutch Image Description and Eye-tracking Corpus0
LaMOuR: Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning0
Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images0
Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data0
Local Higher-Order Statistics (LHS) describing images with statistics of local non-binarized pixel patterns0
Mind's Eye: A Recurrent Visual Representation for Image Caption Generation0
Data-augmented phrase-level alignment for mitigating object hallucination0
Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset0
Multimodal fusion via cortical network inspired losses0
Multi-modal gated recurrent units for image description0
Multimodal Machine Translation with Reinforcement Learning0
Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data0
Neural Dependency Coding inspired Multimodal Fusion0
On the Use of Deep Learning for Blind Image Quality Assessment0
On the use of human reference data for evaluating automatic image descriptions0
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs0
Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis0
phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning0
Phrase-based Image Captioning with Hierarchical LSTM Model0
Place recognition: An Overview of Vision Perspective0
Place recognition in gardens by learning visual representations: data set and benchmark analysis0
Pragmatic descriptions of perceptual stimuli0
Recurrent Attention Unit0
Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering0
Recurrent Topic-Transition GAN for Visual Paragraph Generation0
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant0
Seeing the Unseen: Visual Common Sense for Semantic Placement0
Sequential Attention GAN for Interactive Image Editing0
Simple Image Description Generator via a Linear Phrase-Based Approach0
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition0
Tell Me More: A Dataset of Visual Scene Description Sequences0
Textual Visual Semantic Dataset for Text Spotting0
The Image Torque Operator for Contour Processing0
The Lexical Gap: An Improved Measure of Automated Image Description Quality0
The Long-Short Story of Movie Description0
The Task Matters: Comparing Image Captioning and Task-Based Dialogical Image Description0
TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models0
Unsupervised Stylish Image Description Generation via Domain Layer Norm0
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions0
Weakly Supervised Learning of Objects, Attributes and their Associations0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.