SOTAVerified

Image Description

Papers

Showing 101125 of 154 papers

TitleStatusHype
WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization0
Zero-Resource Neural Machine Translation with Multi-Agent Communication Game0
Focused Evaluation for Image Description with Binary Forced-Choice Tasks0
From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning0
Generating Image Captions in Arabic using Root-Word Based Recurrent Neural Networks and Deep Neural Networks0
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation0
Im2Text: Describing Images Using 1 Million Captioned Photographs0
Image Description Dataset for Language Learners0
Image Description using Visual Dependency Representations0
Image Pivoting for Learning Multilingual Multimodal Representations0
Impressions: Understanding Visual Semiotics and Aesthetic Impact0
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments0
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models0
Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image RetrievalCode0
The Treasure beneath Convolutional Layers: Cross-convolutional-layer Pooling for Image ClassificationCode0
On Architectures for Including Visual Information in Neural Language Models for Image DescriptionCode0
Bridging Languages through Images with Deep Partial Canonical Correlation AnalysisCode0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face DescriptionsCode0
Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline)Code0
How Do Image Description Systems Describe People? A Targeted Assessment of System Competence in the PEOPLE-domainCode0
IDEA: Image Description Enhanced CLIP-AdapterCode0
Deep Imbalanced Attribute Classification using Visual Attention AggregationCode0
Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional NetworkCode0
Unsupervised Image CaptioningCode0
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.