| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 |
| A Fast Automatic Method for Deconvoluting Macro X-ray Fluorescence Data Collected from Easel Paintings | Oct 31, 2022 | FAD | —Unverified | 0 |
| A General Framework for Learning Procedural Audio Models of Environmental Sounds | Mar 4, 2023 | FAD | —Unverified | 0 |
| A Study on Robustness to Perturbations for Representations of Environmental Sound | Mar 20, 2022 | FADTransfer Learning | —Unverified | 0 |
| Audiobox: Unified Audio Generation with Natural Language Prompts | Dec 25, 2023 | AudioCapsAudio Generation | —Unverified | 0 |
| Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech2 | Jul 19, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 |
| Detecting immune cells with label-free two-photon autofluorescence and deep learning | Jun 17, 2025 | Binary ClassificationClassification | —Unverified | 0 |
| DRAGON: Distributional Rewards Optimize Diffusion Generative Models | Apr 21, 2025 | FAD | —Unverified | 0 |
| Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Apr 29, 2025 | Action GenerationFAD | —Unverified | 0 |
| Enhancing U.S. swine farm preparedness for infectious foreign animal diseases with rapid access to biosecurity information | Apr 12, 2025 | FAD | —Unverified | 0 |
| Exploring compressibility of transformer based text-to-music (TTM) models | Jun 24, 2024 | DecoderFAD | —Unverified | 0 |
| FaceCat: Enhancing Face Recognition Security with a Unified Diffusion Model | Apr 14, 2024 | Face Anti-SpoofingFace Recognition | —Unverified | 0 |
| FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning | May 13, 2025 | Cross-Domain Few-Shotcross-domain few-shot learning | —Unverified | 0 |
| FAD-Net: Frequency-Domain Attention-Guided Diffusion Network for Coronary Artery Segmentation using Invasive Coronary Angiography | Jun 13, 2025 | Coronary Artery SegmentationFAD | —Unverified | 0 |
| FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method | Apr 28, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning | Oct 26, 2022 | Cross-Modal RetrievalDecoder | —Unverified | 0 |
| Feature Adversarial Distillation for Point Cloud Classification | Jun 25, 2023 | ClassificationFAD | —Unverified | 0 |
| Quantum Machine Learning: Fad or Future? | Jun 20, 2021 | BIG-bench Machine LearningFAD | —Unverified | 0 |
| RenderBox: Expressive Performance Rendering with Text Control | Feb 11, 2025 | DiversityFAD | —Unverified | 0 |
| Responding to Illegal Activities Along the Canadian Coastlines Using Reinforcement Learning | Aug 5, 2021 | FADreinforcement-learning | —Unverified | 0 |
| Retrieval-Augmented Text-to-Audio Generation | Sep 14, 2023 | AudioCapsAudio Generation | —Unverified | 0 |
| Sensing Performance of Multi-Channel RFID-based Finger Augmentation Devices for Tactile Internet | May 24, 2022 | FAD | —Unverified | 0 |
| Sound Scene Synthesis at the DCASE 2024 Challenge | Jan 15, 2025 | FAD | —Unverified | 0 |
| TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis | Apr 8, 2025 | Audio SynthesisFAD | —Unverified | 0 |