SOTAVerified

Image Description

Papers

Showing 5160 of 154 papers

TitleStatusHype
Face2Text revisited: Improved data set and baseline results0
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation0
Multimodal fusion via cortical network inspired losses0
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
Neural Dependency Coding inspired Multimodal Fusion0
CIDEr-R: Robust Consensus-based Image Description Evaluation0
Cross Modification Attention Based Deliberation Model for Image Captioning0
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant0
Zero-Shot Out-of-Distribution Detection Based on the Pre-trained Model CLIPCode1
Revisiting Binary Local Image Description for Resource Limited DevicesCode1
Show:102550
← PrevPage 6 of 16Next →

No leaderboard results yet.