SOTAVerified

AudioCaps

Papers

Showing 5164 of 64 papers

TitleStatusHype
Text-to-Audio Generation Synchronized with Videos0
Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model0
AC/DC: LLM-based Audio Comprehension via Dialogue Continuation0
Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer0
Retrieval-Augmented Text-to-Audio Generation0
Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning0
Audio-text Retrieval in Context0
Estimated Audio-Caption Correspondences Improve Language-Based Audio RetrievalCode0
Accommodating Audio Modality in CLIP for Multimodal ProcessingCode0
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and GenerationCode0
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGSCode0
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMsCode0
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution ErrorsCode0
Weakly-supervised Automated Audio Captioning via text only trainingCode0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.