SOTAVerified

AudioCaps

Papers

Showing 110 of 64 papers

TitleStatusHype
GLAP: General contrastive audio-text pretraining across domains and languagesCode2
AC/DC: LLM-based Audio Comprehension via Dialogue Continuation0
IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling0
Mitigating Audiovisual Mismatch in Visual-Guide Audio Captioning0
AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion0
DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap0
Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model0
TAIL: Text-Audio Incremental Learning0
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution ErrorsCode0
Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning0
Show:102550
← PrevPage 1 of 7Next →

No leaderboard results yet.