SOTAVerified

Image Description

Papers

Showing 101150 of 154 papers

TitleStatusHype
Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data0
Weakly Supervised Learning of Objects, Attributes and their Associations0
Neural Dependency Coding inspired Multimodal Fusion0
Unsupervised Stylish Image Description Generation via Domain Layer Norm0
On the Use of Deep Learning for Blind Image Quality Assessment0
On the use of human reference data for evaluating automatic image descriptions0
Artwork Explanation in Large-scale Vision Language Models0
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs0
Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis0
phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning0
Phrase-based Image Captioning with Hierarchical LSTM Model0
Place recognition: An Overview of Vision Perspective0
Place recognition in gardens by learning visual representations: data set and benchmark analysis0
Pragmatic descriptions of perceptual stimuli0
A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision-Language Models0
A Preliminary Survey of Semantic Descriptive Model for Images0
Recurrent Attention Unit0
Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering0
Recurrent Topic-Transition GAN for Visual Paragraph Generation0
Annotation Methodologies for Vision and Language Dataset Creation0
WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization0
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions0
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant0
Cross-validating Image Description Datasets and Evaluation Metrics0
Curriculum Learning for Multi-Task Classification of Visual Attributes0
Curriculum Learning of Visual Attribute Clusters for Multi-Task Classification0
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering0
Cross Modification Attention Based Deliberation Model for Image Captioning0
A Hierarchical Approach for Visual Storytelling Using Image Description0
Computer Vision and Conflicting Values: Describing People with Automated Alt Text0
DIDEC: The Dutch Image Description and Eye-tracking Corpus0
DiffCap: Exploring Continuous Diffusion on Image Captioning0
Seeing the Unseen: Visual Common Sense for Semantic Placement0
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space0
Sequential Attention GAN for Interactive Image Editing0
Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation0
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation0
Draw and Tell: Multimodal Descriptions Outperform Verbal- or Sketch-Only Descriptions in an Image Retrieval Task0
Simple Image Description Generator via a Linear Phrase-Based Approach0
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks0
EPYNET: Efficient Pyramidal Network for Clothing Segmentation0
Exemplar SVMs as Visual Feature Encoders0
Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images0
Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis0
Exploring Visual Relationship for Image Captioning0
Zero-Resource Neural Machine Translation with Multi-Agent Communication Game0
Face2Text revisited: Improved data set and baseline results0
Facial Expression Recognition and Image Description Generation in Vietnamese0
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition0
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.