| Audiobox: Unified Audio Generation with Natural Language Prompts | Dec 25, 2023 | AudioCapsAudio Generation | —Unverified | 0 | 0 |
| Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech2 | Jul 19, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 | 0 |
| Detecting immune cells with label-free two-photon autofluorescence and deep learning | Jun 17, 2025 | Binary ClassificationClassification | —Unverified | 0 | 0 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 | 0 |
| DRAGON: Distributional Rewards Optimize Diffusion Generative Models | Apr 21, 2025 | FAD | —Unverified | 0 | 0 |
| Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Apr 29, 2025 | Action GenerationFAD | —Unverified | 0 | 0 |
| Enhancing U.S. swine farm preparedness for infectious foreign animal diseases with rapid access to biosecurity information | Apr 12, 2025 | FAD | —Unverified | 0 | 0 |
| Exploring compressibility of transformer based text-to-music (TTM) models | Jun 24, 2024 | DecoderFAD | —Unverified | 0 | 0 |
| FaceCat: Enhancing Face Recognition Security with a Unified Diffusion Model | Apr 14, 2024 | Face Anti-SpoofingFace Recognition | —Unverified | 0 | 0 |