| Detecting immune cells with label-free two-photon autofluorescence and deep learning | Jun 17, 2025 | Binary ClassificationClassification | —Unverified | 0 |
| FAD-Net: Frequency-Domain Attention-Guided Diffusion Network for Coronary Artery Segmentation using Invasive Coronary Angiography | Jun 13, 2025 | Coronary Artery SegmentationFAD | —Unverified | 0 |
| BemaGANv2: A Tutorial and Comparative Survey of GAN-based Vocoders for Long-Term Audio Generation | Jun 11, 2025 | Audio GenerationFAD | CodeCode Available | 1 |
| IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling | May 31, 2025 | AudioCapsAudio Generation | —Unverified | 0 |
| FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning | May 13, 2025 | Cross-Domain Few-Shotcross-domain few-shot learning | —Unverified | 0 |
| Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Apr 29, 2025 | Action GenerationFAD | —Unverified | 0 |
| DOSE : Drum One-Shot Extraction from Music Mixture | Apr 25, 2025 | FAD | CodeCode Available | 1 |
| DRAGON: Distributional Rewards Optimize Diffusion Generative Models | Apr 21, 2025 | FAD | —Unverified | 0 |
| Enhancing U.S. swine farm preparedness for infectious foreign animal diseases with rapid access to biosecurity information | Apr 12, 2025 | FAD | —Unverified | 0 |
| TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis | Apr 8, 2025 | Audio SynthesisFAD | —Unverified | 0 |
| Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization | Mar 28, 2025 | Audio GenerationFAD | CodeCode Available | 1 |
| Aligning Text-to-Music Evaluation with Human Preferences | Mar 20, 2025 | FAD | CodeCode Available | 1 |
| FlowDec: A flow-based full-band general audio codec with high perceptual quality | Mar 3, 2025 | FAD | CodeCode Available | 2 |
| KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation | Feb 21, 2025 | Audio GenerationFAD | CodeCode Available | 2 |
| RenderBox: Expressive Performance Rendering with Text Control | Feb 11, 2025 | DiversityFAD | —Unverified | 0 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 |
| Sound Scene Synthesis at the DCASE 2024 Challenge | Jan 15, 2025 | FAD | —Unverified | 0 |
| Market Making with Fads, Informed, and Uninformed Traders | Jan 7, 2025 | FAD | —Unverified | 0 |
| Frechet Music Distance: A Metric For Generative Symbolic Music Evaluation | Dec 10, 2024 | FADMusic Generation | CodeCode Available | 1 |
| MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System | Sep 27, 2024 | Anomaly DetectionFAD | —Unverified | 0 |
| Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance | Sep 23, 2024 | Emotion RecognitionFAD | CodeCode Available | 3 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 |
| Multi-Source Music Generation with Latent Diffusion | Sep 10, 2024 | FADMusic Generation | CodeCode Available | 1 |
| Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer | Sep 9, 2024 | FAD | —Unverified | 0 |
| AnoPLe: Few-Shot Anomaly Detection via Bi-directional Prompt Learning with Only Normal Samples | Aug 24, 2024 | Anomaly DetectionDecoder | CodeCode Available | 0 |