SOTAVerified

AudioCaps

Papers

Showing 6164 of 64 papers

TitleStatusHype
Accommodating Audio Modality in CLIP for Multimodal ProcessingCode0
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution ErrorsCode0
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and GenerationCode0
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGSCode0
Show:102550
← PrevPage 7 of 7Next →

No leaderboard results yet.