SOTAVerified

Image Description

Papers

Showing 5175 of 154 papers

TitleStatusHype
Face2Text revisited: Improved data set and baseline results0
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation0
Multimodal fusion via cortical network inspired losses0
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
Neural Dependency Coding inspired Multimodal Fusion0
CIDEr-R: Robust Consensus-based Image Description EvaluationCode0
Cross Modification Attention Based Deliberation Model for Image Captioning0
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant0
Zero-Shot Out-of-Distribution Detection Based on the Pre-trained Model CLIPCode1
Revisiting Binary Local Image Description for Resource Limited DevicesCode1
Computer Vision and Conflicting Values: Describing People with Automated Alt Text0
How Do Image Description Systems Describe People? A Targeted Assessment of System Competence in the PEOPLE-domainCode0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
EPYNET: Efficient Pyramidal Network for Clothing Segmentation0
Multimodal Word Sense Disambiguation in Creative PracticeCode0
On the use of human reference data for evaluating automatic image descriptions0
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs0
Textual Visual Semantic Dataset for Text Spotting0
On Architectures for Including Visual Information in Neural Language Models for Image DescriptionCode0
Tell Me More: A Dataset of Visual Scene Description Sequences0
A Hierarchical Approach for Visual Storytelling Using Image Description0
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions0
Place recognition in gardens by learning visual representations: data set and benchmark analysis0
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments0
What a neural language model tells us about spatial relationsCode0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.