SOTAVerified

Caption Generation

Papers

Showing 2130 of 310 papers

TitleStatusHype
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic DataCode1
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change CaptioningCode1
HCQA @ Ego4D EgoSchema Challenge 2024Code1
SoccerNet-Echoes: A Soccer Game Audio Commentary DatasetCode1
BCAmirs at SemEval-2024 Task 4: Beyond Words: A Multimodal and Multilingual Exploration of Persuasion in MemesCode1
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph EnrichmentCode1
VLIS: Unimodal Language Models Guide Multimodal Language GenerationCode1
Self-supervised Cross-view Representation Reconstruction for Change CaptioningCode1
RECAP: Retrieval-Augmented Audio CaptioningCode1
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query ResponseCode1
Show:102550
← PrevPage 3 of 31Next →

No leaderboard results yet.