| CLOTH4D: A Dataset for Clothed Human Reconstruction | Jan 1, 2023 | FAD | CodeCode Available | 0 | 5 |
| Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference | Mar 14, 2023 | FADTranslation | CodeCode Available | 0 | 5 |
| Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning | Nov 28, 2022 | FADVideo Captioning | CodeCode Available | 0 | 5 |
| Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI | May 29, 2024 | FAD | CodeCode Available | 0 | 5 |
| Latent CLAP Loss for Better Foley Sound Synthesis | Mar 18, 2024 | FAD | CodeCode Available | 0 | 5 |
| Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrograms | Jun 25, 2022 | FAD | CodeCode Available | 0 | 5 |
| Flatness-Aware Minimization for Domain Generalization | Jul 20, 2023 | Domain GeneralizationFAD | —Unverified | 0 | 0 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 | 0 |
| Federated Automatic Differentiation | Jan 18, 2023 | FADFederated Learning | —Unverified | 0 | 0 |
| Feature Adversarial Distillation for Point Cloud Classification | Jun 25, 2023 | ClassificationFAD | —Unverified | 0 | 0 |