SOTAVerified

Caption Generation

Papers

Showing 91100 of 310 papers

TitleStatusHype
Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning0
LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation0
VidCoM: Fast Video Comprehension through Large Language Models with Multimodal Tools0
ViPE: Visualise Pretty-much EverythingCode0
VLIS: Unimodal Language Models Guide Multimodal Language GenerationCode1
A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation0
Self-supervised Cross-view Representation Reconstruction for Change CaptioningCode1
FaceGemma: Enhancing Image Captioning with Facial Attributes for Portrait Images0
Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning0
RECAP: Retrieval-Augmented Audio CaptioningCode1
Show:102550
← PrevPage 10 of 31Next →

No leaderboard results yet.