| MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis | Dec 19, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 7 | 5 |
| AudioLCM: Text-to-Audio Generation with Latent Consistency Models | Jun 1, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 5 | 5 |
| Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Jun 1, 2023 | Audio SynthesisComputational Efficiency | CodeCode Available | 4 | 5 |
| Diffusion-TS: Interpretable Diffusion for General Time Series Generation | Mar 4, 2024 | Audio SynthesisDecoder | CodeCode Available | 3 | 5 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 | 5 |
| DiffMoog: a Differentiable Modular Synthesizer for Sound Matching | Jan 23, 2024 | Audio Synthesis | CodeCode Available | 2 | 5 |
| Efficient Neural Audio Synthesis | Feb 23, 2018 | Audio SynthesisCPU | CodeCode Available | 2 | 5 |
| FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models | Dec 13, 2023 | 3D Face AnimationAudio Synthesis | CodeCode Available | 2 | 5 |
| DDSP: Differentiable Digital Signal Processing | Jan 14, 2020 | Audio GenerationAudio Synthesis | CodeCode Available | 2 | 5 |
| Taming Data and Transformers for Audio Generation | Jun 27, 2024 | Audio captioningAudio Generation | CodeCode Available | 2 | 5 |
| Differentiable All-pole Filters for Time-varying Audio Systems | Apr 11, 2024 | AllAudio Effects Modeling | CodeCode Available | 2 | 5 |
| RAVE: A variational autoencoder for fast and high-quality neural audio synthesis | Nov 9, 2021 | Audio SynthesisCPU | CodeCode Available | 2 | 5 |
| OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows | Dec 2, 2024 | Audio SynthesisImage Generation | CodeCode Available | 2 | 5 |
| Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis | Jun 7, 2024 | Audio Synthesis | CodeCode Available | 2 | 5 |
| Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models | Jun 29, 2023 | Audio Synthesis | CodeCode Available | 2 | 5 |
| SpeedySpeech: Efficient Neural Speech Synthesis | Aug 9, 2020 | Audio SynthesisCPU | CodeCode Available | 1 | 5 |
| CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation | Jul 7, 2021 | Audio SynthesisImage Generation | CodeCode Available | 1 | 5 |
| Where are we in audio deepfake detection? A systematic analysis over generative and detection models | Oct 6, 2024 | Audio Deepfake DetectionAudio Synthesis | CodeCode Available | 1 | 5 |
| Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance | Dec 13, 2022 | Audio SynthesisImage Generation | CodeCode Available | 1 | 5 |
| T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis | Jan 17, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 1 | 5 |
| Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders | Apr 5, 2017 | Audio SynthesisDecoder | CodeCode Available | 1 | 5 |
| MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling | Dec 17, 2021 | Audio Synthesis | CodeCode Available | 1 | 5 |
| Realistic Gramophone Noise Synthesis using a Diffusion Model | Jun 13, 2022 | Audio Synthesis | CodeCode Available | 1 | 5 |
| Continuous descriptor-based control for deep audio synthesis | Feb 27, 2023 | Audio Synthesiscontinuous-control | CodeCode Available | 1 | 5 |
| Audeo: Audio Generation for a Silent Performance Video | Jun 23, 2020 | Audio GenerationAudio Synthesis | CodeCode Available | 1 | 5 |
| Real-time Timbre Transfer and Sound Synthesis using DDSP | Mar 12, 2021 | Audio Synthesis | CodeCode Available | 1 | 5 |
| VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding | May 4, 2021 | Audio Synthesis | CodeCode Available | 1 | 5 |
| Neural Waveshaping Synthesis | Jul 11, 2021 | Audio GenerationAudio Synthesis | CodeCode Available | 1 | 5 |
| DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks | Aug 27, 2020 | Audio SynthesisGenerative Adversarial Network | CodeCode Available | 1 | 5 |
| Tacotron: Towards End-to-End Speech Synthesis | Mar 29, 2017 | Audio SynthesisSpeech Synthesis | CodeCode Available | 1 | 5 |
| LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis | Jul 15, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 1 | 5 |
| Differentiable Wavetable Synthesis | Nov 19, 2021 | Audio SynthesisOne-Shot Learning | CodeCode Available | 1 | 5 |
| An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling | Mar 4, 2018 | Audio SynthesisLanguage Modelling | CodeCode Available | 1 | 5 |
| Points2Sound: From mono to binaural audio using 3D point cloud scenes | Apr 26, 2021 | Audio Synthesis | CodeCode Available | 1 | 5 |
| Upsampling artifacts in neural audio synthesis | Oct 27, 2020 | Audio Signal ProcessingAudio Synthesis | CodeCode Available | 1 | 5 |
| Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance | Jun 16, 2020 | Audio Synthesis | CodeCode Available | 1 | 5 |
| Generative diffusion model with inverse renormalization group flows | Jan 15, 2025 | Audio SynthesisDenoising | CodeCode Available | 1 | 5 |
| Perceptual-Neural-Physical Sound Matching | Jan 7, 2023 | AttributeAudio Synthesis | CodeCode Available | 1 | 5 |
| VaPar Synth -- A Variational Parametric Model for Audio Synthesis | Mar 30, 2020 | Audio Synthesis | CodeCode Available | 1 | 5 |
| Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and Plates | May 9, 2025 | Audio SynthesisCPU | CodeCode Available | 1 | 5 |
| Adversarial Audio Synthesis | Feb 12, 2018 | Audio GenerationAudio Synthesis | CodeCode Available | 1 | 5 |
| Fre-GAN: Adversarial Frequency-consistent Audio Synthesis | Jun 4, 2021 | Audio Synthesis | CodeCode Available | 1 | 5 |
| ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning | Sep 19, 2024 | Audio Synthesis | CodeCode Available | 1 | 5 |
| DiffWave: A Versatile Diffusion Model for Audio Synthesis | Sep 21, 2020 | Audio SynthesisDiversity | CodeCode Available | 1 | 5 |
| Adversarial Generation of Time-Frequency Features with application in audio synthesis | Feb 11, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 | 5 |
| SING: Symbol-to-Instrument Neural Generator | Oct 23, 2018 | Audio SynthesisDecoder | CodeCode Available | 0 | 5 |
| Music Source Separation in the Waveform Domain | Nov 27, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 | 5 |
| A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models | Oct 29, 2019 | Audio Synthesis | CodeCode Available | 0 | 5 |
| CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders | Nov 9, 2021 | Audio SynthesisMixed Reality | CodeCode Available | 0 | 5 |
| Introducing Latent Timbre Synthesis | May 31, 2020 | Audio Synthesis | CodeCode Available | 0 | 5 |