SOTAVerified

AudioCaps

Papers

Showing 3140 of 64 papers

TitleStatusHype
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap0
Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model0
TAIL: Text-Audio Incremental Learning0
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution ErrorsCode0
Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning0
Language-based Audio Retrieval with Co-Attention Networks0
Enhancing Retrieval-Augmented Audio Captioning with Generation-Assisted Multimodal Querying and Progressive Learning0
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMsCode0
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval0
Dissecting Temporal Understanding in Text-to-Audio Retrieval0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.