SOTAVerified

AudioCaps

Papers

Showing 3140 of 64 papers

TitleStatusHype
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and GenerationCode0
Accommodating Audio Modality in CLIP for Multimodal ProcessingCode0
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGSCode0
Audiobox: Unified Audio Generation with Natural Language Prompts0
IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling0
Joint Speech Recognition and Audio Captioning0
Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval?0
Language-based Audio Retrieval with Co-Attention Networks0
TAIL: Text-Audio Incremental Learning0
Leveraging Pre-trained BERT for Audio Captioning0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.