| Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance | Sep 23, 2024 | Emotion RecognitionFAD | CodeCode Available | 3 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 |
| Multi-Source Music Generation with Latent Diffusion | Sep 10, 2024 | FADMusic Generation | CodeCode Available | 1 |
| Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer | Sep 9, 2024 | FAD | —Unverified | 0 |
| AnoPLe: Few-Shot Anomaly Detection via Bi-directional Prompt Learning with Only Normal Samples | Aug 24, 2024 | Anomaly DetectionDecoder | CodeCode Available | 0 |
| Efficient Autoregressive Audio Modeling via Next-Scale Prediction | Aug 16, 2024 | Audio GenerationFAD | CodeCode Available | 2 |
| L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Navigation | CodeCode Available | 2 |
| Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech2 | Jul 19, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| Taming Data and Transformers for Audio Generation | Jun 27, 2024 | Audio captioningAudio Generation | CodeCode Available | 2 |
| Exploring compressibility of transformer based text-to-music (TTM) models | Jun 24, 2024 | DecoderFAD | —Unverified | 0 |