| MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models | Jun 7, 2024 | FADText-to-Music Generation | CodeCode Available | 2 |
| Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI | May 29, 2024 | FAD | CodeCode Available | 0 |
| FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method | Apr 28, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| FaceCat: Enhancing Face Recognition Security with a Unified Diffusion Model | Apr 14, 2024 | Face Anti-SpoofingFace Recognition | —Unverified | 0 |
| Latent CLAP Loss for Better Foley Sound Synthesis | Mar 18, 2024 | FAD | CodeCode Available | 0 |
| MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction | Jan 24, 2024 | FAD | —Unverified | 0 |
| Audiobox: Unified Audio Generation with Natural Language Prompts | Dec 25, 2023 | AudioCapsAudio Generation | —Unverified | 0 |
| Adapting Frechet Audio Distance for Generative Music Evaluation | Nov 2, 2023 | FAD | CodeCode Available | 2 |
| Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis | Sep 21, 2023 | FADInformation Retrieval | —Unverified | 0 |
| Retrieval-Augmented Text-to-Audio Generation | Sep 14, 2023 | AudioCapsAudio Generation | —Unverified | 0 |