| Unified speech and gesture synthesis using flow matching | Oct 8, 2023 | Audio SynthesisMotion Synthesis | —Unverified | 0 |
| Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer | Sep 26, 2023 | Audio Synthesis | —Unverified | 0 |
| DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing | Sep 14, 2023 | AttributeAudio Synthesis | —Unverified | 0 |
| SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias | Sep 14, 2023 | Audio SynthesisGenerative Adversarial Network | —Unverified | 0 |
| Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models | Jun 29, 2023 | Audio Synthesis | CodeCode Available | 2 |
| CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models | Jun 16, 2023 | Audio Synthesis | —Unverified | 0 |
| Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Jun 1, 2023 | Audio SynthesisComputational Efficiency | CodeCode Available | 4 |
| Continuous descriptor-based control for deep audio synthesis | Feb 27, 2023 | Audio Synthesiscontinuous-control | CodeCode Available | 1 |
| ECGAN: Self-supervised generative adversarial network for electrocardiography | Jan 23, 2023 | Audio SynthesisDiversity | —Unverified | 0 |
| Perceptual-Neural-Physical Sound Matching | Jan 7, 2023 | AttributeAudio Synthesis | CodeCode Available | 1 |
| DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect | Dec 14, 2022 | Audio Synthesis | —Unverified | 0 |
| Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance | Dec 13, 2022 | Audio SynthesisImage Generation | CodeCode Available | 1 |
| Conditional variational autoencoder to improve neural audio synthesis for polyphonic music sound | Nov 16, 2022 | Audio Synthesis | —Unverified | 0 |
| Full-band General Audio Synthesis with Score-based Diffusion | Oct 26, 2022 | Audio SynthesisDiversity | —Unverified | 0 |
| Anisotropic multiresolution analyses for deepfake detection | Oct 26, 2022 | Audio SynthesisDeepFake Detection | —Unverified | 0 |
| From Words to Sound: Neural Audio Synthesis of Guitar Sounds with Timbral Descriptors | Sep 17, 2022 | Audio Synthesis | CodeCode Available | 0 |
| Evaluating generative audio systems and their metrics | Aug 31, 2022 | Audio Synthesis | —Unverified | 0 |
| Convergence of denoising diffusion models under the manifold hypothesis | Aug 10, 2022 | Audio SynthesisDenoising | —Unverified | 0 |
| Adversarial Audio Synthesis with Complex-valued Polynomial Networks | Jun 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| Realistic Gramophone Noise Synthesis using a Diffusion Model | Jun 13, 2022 | Audio Synthesis | CodeCode Available | 1 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 |
| Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator | Jun 5, 2022 | Audio SynthesisDisentanglement | —Unverified | 0 |
| BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis | May 30, 2022 | Audio Synthesis | CodeCode Available | 0 |
| Streamable Neural Audio Synthesis With Non-Causal Convolutions | Apr 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge | Mar 3, 2022 | Audio Deepfake DetectionAudio Synthesis | —Unverified | 0 |
| MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling | Dec 17, 2021 | Audio Synthesis | CodeCode Available | 1 |
| Differentiable Wavetable Synthesis | Nov 19, 2021 | Audio SynthesisOne-Shot Learning | CodeCode Available | 1 |
| Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations | Nov 14, 2021 | Audio Synthesis | —Unverified | 0 |
| CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders | Nov 9, 2021 | Audio SynthesisMixed Reality | CodeCode Available | 0 |
| RAVE: A variational autoencoder for fast and high-quality neural audio synthesis | Nov 9, 2021 | Audio SynthesisCPU | CodeCode Available | 2 |
| Estimating High Order Gradients of the Data Distribution by Denoising | Nov 8, 2021 | Audio SynthesisDenoising | —Unverified | 0 |
| Synthesising Audio Adversarial Examples for Automatic Speech Recognition | Sep 29, 2021 | Audio SynthesisAutomatic Speech Recognition | —Unverified | 0 |
| A Survey on Audio Synthesis and Audio-Visual Multimodal Processing | Aug 1, 2021 | Audio SynthesisMusic Generation | —Unverified | 0 |
| Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition | Jul 23, 2021 | Audio Synthesisspeech-recognition | —Unverified | 0 |
| Neural Waveshaping Synthesis | Jul 11, 2021 | Audio GenerationAudio Synthesis | CodeCode Available | 1 |
| CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation | Jul 7, 2021 | Audio SynthesisImage Generation | CodeCode Available | 1 |
| A Generative Model for Raw Audio Using Transformer Architectures | Jun 30, 2021 | Audio Synthesis | —Unverified | 0 |
| CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis | Jun 14, 2021 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| Fre-GAN: Adversarial Frequency-consistent Audio Synthesis | Jun 4, 2021 | Audio Synthesis | CodeCode Available | 1 |
| DPLM: A Deep Perceptual Spatial-Audio Localization Metric | May 29, 2021 | Audio Synthesis | —Unverified | 0 |
| VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding | May 4, 2021 | Audio Synthesis | CodeCode Available | 1 |
| Points2Sound: From mono to binaural audio using 3D point cloud scenes | Apr 26, 2021 | Audio Synthesis | CodeCode Available | 1 |
| On tuning consistent annealed sampling for denoising score matching | Apr 8, 2021 | Audio SynthesisDenoising | —Unverified | 0 |
| Signal Representations for Synthesizing Audio Textures with Generative Adversarial Networks | Mar 12, 2021 | Audio Synthesis | —Unverified | 0 |
| Real-time Timbre Transfer and Sound Synthesis using DDSP | Mar 12, 2021 | Audio Synthesis | CodeCode Available | 1 |
| Anyone GAN Sing | Feb 22, 2021 | Audio Synthesis | —Unverified | 0 |
| Upsampling artifacts in neural audio synthesis | Oct 27, 2020 | Audio Signal ProcessingAudio Synthesis | CodeCode Available | 1 |
| A Novel Face-tracking Mouth Controller and its Application to Interacting with Bioacoustic Models | Oct 7, 2020 | Audio Synthesis | —Unverified | 0 |
| DiffWave: A Versatile Diffusion Model for Audio Synthesis | Sep 21, 2020 | Audio SynthesisDiversity | CodeCode Available | 1 |
| DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks | Aug 27, 2020 | Audio SynthesisGenerative Adversarial Network | CodeCode Available | 1 |