SOTAVerified

Image Description

Papers

Showing 2650 of 154 papers

TitleStatusHype
Computer Vision and Conflicting Values: Describing People with Automated Alt Text0
A Fine-Grained Image Description Generation Method Based on Joint Objectives0
A Genetic Algorithm Approach for ImageRepresentation Learning through Color Quantization0
A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching0
From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning0
Comparing Automatic Evaluation Measures for Image Description0
Collecting Image Description Datasets using Crowdsourcing0
Advanced Chest X-Ray Analysis via Transformer-Based Image Descriptors and Cross-Model Attention Mechanism0
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation0
A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision-Language Models0
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description0
Generating Image Captions in Arabic using Root-Word Based Recurrent Neural Networks and Deep Neural Networks0
Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis0
Artwork Explanation in Large-scale Vision Language Models0
Exploring Visual Relationship for Image Captioning0
DiffCap: Exploring Continuous Diffusion on Image Captioning0
DIDEC: The Dutch Image Description and Eye-tracking Corpus0
A Preliminary Survey of Semantic Descriptive Model for Images0
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space0
Adding the Third Dimension to Spatial Relation Detection in 2D Images0
Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation0
Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images0
Draw and Tell: Multimodal Descriptions Outperform Verbal- or Sketch-Only Descriptions in an Image Retrieval Task0
A Shared Task on Multimodal Machine Translation and Crosslingual Image Description0
Face2Text revisited: Improved data set and baseline results0
Show:102550
← PrevPage 2 of 7Next →

No leaderboard results yet.