SOTAVerified

Image Description

Papers

Showing 4150 of 154 papers

TitleStatusHype
Caption Anything: Interactive Image Description with Diverse Multimodal ControlsCode3
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language ModelsCode7
Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image RetrievalCode0
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetCode1
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text GenerationCode1
Improving Visual-Semantic Embeddings by Learning Semantically-Enhanced Hard Negatives for Cross-modal Information RetrievalCode0
Facial Expression Recognition and Image Description Generation in Vietnamese0
Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional NetworkCode0
Image Description Dataset for Language Learners0
Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset0
Show:102550
← PrevPage 5 of 16Next →

No leaderboard results yet.