| RAVE: A variational autoencoder for fast and high-quality neural audio synthesis | Nov 9, 2021 | Audio SynthesisCPU | CodeCode Available | 2 | 5 |
| FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models | Dec 13, 2023 | 3D Face AnimationAudio Synthesis | CodeCode Available | 2 | 5 |
| OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows | Dec 2, 2024 | Audio SynthesisImage Generation | CodeCode Available | 2 | 5 |
| Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models | Jun 29, 2023 | Audio Synthesis | CodeCode Available | 2 | 5 |
| Taming Data and Transformers for Audio Generation | Jun 27, 2024 | Audio captioningAudio Generation | CodeCode Available | 2 | 5 |
| Audeo: Audio Generation for a Silent Performance Video | Jun 23, 2020 | Audio GenerationAudio Synthesis | CodeCode Available | 1 | 5 |
| Differentiable Wavetable Synthesis | Nov 19, 2021 | Audio SynthesisOne-Shot Learning | CodeCode Available | 1 | 5 |
| Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and Plates | May 9, 2025 | Audio SynthesisCPU | CodeCode Available | 1 | 5 |
| DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks | Aug 27, 2020 | Audio SynthesisGenerative Adversarial Network | CodeCode Available | 1 | 5 |
| An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling | Mar 4, 2018 | Audio SynthesisLanguage Modelling | CodeCode Available | 1 | 5 |