| Audio Retrieval with Natural Language Queries | May 5, 2021 | AudioCapsAudio to Text Retrieval | CodeCode Available | 1 |
| ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation | Sep 19, 2023 | AudioCapsAudio Generation | CodeCode Available | 1 |
| Audio Retrieval with Natural Language Queries: A Benchmark Study | Dec 17, 2021 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Audio Retrieval with WavText5K and CLAP Training | Sep 28, 2022 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Prefix tuning for automated audio captioning | Mar 30, 2023 | AudioCapsAudio captioning | CodeCode Available | 1 |
| LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport | Jan 16, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates | Nov 14, 2022 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Audio Captioning Transformer | Jul 21, 2021 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Bridging Language Gaps in Audio-Text Retrieval | Jun 11, 2024 | AudioCapsRetrieval | CodeCode Available | 1 |