SOTAVerified

AudioCaps

Papers

Showing 2130 of 64 papers

TitleStatusHype
Can Audio Captions Be Evaluated with Image Caption Metrics?Code1
Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidatesCode1
Target Sound Extraction with Variable Cross-modality CluesCode1
Visually-Aware Audio Captioning With Adaptive Audio-Visual AttentionCode1
Audio Retrieval with Natural Language QueriesCode1
Revisiting Deep Audio-Text Retrieval Through the Lens of TransportationCode1
Estimated Audio-Caption Correspondences Improve Language-Based Audio RetrievalCode0
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution ErrorsCode0
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMsCode0
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and GenerationCode0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.