| Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models | Jan 30, 2023 | Audio GenerationText-to-Video Generation | CodeCode Available | 2 |
| ArchiSound: Audio Generation with Diffusion | Jan 30, 2023 | Audio GenerationGPU | CodeCode Available | 4 |
| SingSong: Generating musical accompaniments from singing | Jan 30, 2023 | Audio GenerationRetrieval | —Unverified | 0 |
| AudioLDM: Text-to-Audio Generation with Latent Diffusion Models | Jan 29, 2023 | AudioCapsAudio Generation | CodeCode Available | 4 |
| SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning | Oct 16, 2022 | Audio GenerationRepresentation Learning | —Unverified | 0 |
| AudioGen: Textually Guided Audio Generation | Sep 30, 2022 | Audio GenerationDescriptive | CodeCode Available | 6 |
| AudioLM: a Language Modeling Approach to Audio Generation | Sep 7, 2022 | Audio Generation | CodeCode Available | 7 |
| Audio Deepfake Attribution: An Initial Dataset and Investigation | Aug 21, 2022 | Audio GenerationBinary Classification | —Unverified | 0 |
| Diffsound: Discrete Diffusion Model for Text-to-sound Generation | Jul 20, 2022 | Audio GenerationDecoder | CodeCode Available | 2 |
| Adversarial Audio Synthesis with Complex-valued Polynomial Networks | Jun 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 |
| FlexLip: A Controllable Text-to-Lip System | Jun 7, 2022 | Audio Generationtext-to-speech | —Unverified | 0 |
| Symphony Generation with Permutation Invariant Language Model | May 10, 2022 | Audio GenerationDecoder | CodeCode Available | 2 |
| On Target Representation in Continuous-output Neural Machine Translation | May 1, 2022 | Audio GenerationMachine Translation | —Unverified | 0 |
| Differentiable Time-Frequency Scattering on GPU | Apr 18, 2022 | Audio GenerationCPU | CodeCode Available | 1 |
| Streamable Neural Audio Synthesis With Non-Causal Convolutions | Apr 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement | Mar 24, 2022 | Audio GenerationBandwidth Extension | CodeCode Available | 1 |
| It's Raw! Audio Generation with State-Space Models | Feb 20, 2022 | Audio GenerationDensity Estimation | CodeCode Available | 1 |
| ADD 2022: the First Audio Deep Synthesis Detection Challenge | Feb 17, 2022 | Audio Deepfake DetectionAudio Generation | —Unverified | 0 |
| Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus | Dec 20, 2021 | Audio GenerationSinging Voice Synthesis | CodeCode Available | 1 |
| Soundify: Matching Sound Effects to Video | Dec 17, 2021 | Audio GenerationImage Classification | —Unverified | 0 |
| Geometry-Aware Multi-Task Learning for Binaural Audio Generation from Video | Nov 21, 2021 | Audio GenerationMulti-Task Learning | —Unverified | 0 |
| RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses | Nov 1, 2021 | Audio GenerationGenerative Adversarial Network | CodeCode Available | 1 |
| Unsupervised Source Separation By Steering Pretrained Music Models | Oct 25, 2021 | Audio GenerationAudio Source Separation | CodeCode Available | 1 |
| Taming Visually Guided Sound Generation | Oct 17, 2021 | Audio GenerationGPU | CodeCode Available | 1 |
| An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution | Sep 30, 2021 | Audio GenerationAudio Super-Resolution | —Unverified | 0 |
| Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention | Aug 10, 2021 | Audio GenerationDecoder | —Unverified | 0 |
| Neural Waveshaping Synthesis | Jul 11, 2021 | Audio GenerationAudio Synthesis | CodeCode Available | 1 |
| CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis | Jun 14, 2021 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior | Jun 11, 2021 | Audio GenerationDenoising | CodeCode Available | 0 |
| Catch-A-Waveform: Learning to Generate Audio from a Single Short Example | Jun 11, 2021 | Audio GenerationSemantic Similarity | CodeCode Available | 1 |
| Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation | May 3, 2021 | Audio GenerationSelf-Supervised Learning | —Unverified | 0 |
| Visually Informed Binaural Audio Generation without Binaural Audios | Apr 13, 2021 | Audio Generation | —Unverified | 0 |
| Anytime Sampling for Autoregressive Models via Ordered Autoencoding | Feb 23, 2021 | Audio GenerationComputational Efficiency | CodeCode Available | 1 |
| Localize to Binauralize: Audio Spatialization From Visual Sound Source Localization | Jan 1, 2021 | Audio GenerationSound Source Localization | CodeCode Available | 1 |
| Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training | Dec 3, 2020 | Audio GenerationDisentanglement | CodeCode Available | 1 |
| A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions | Nov 13, 2020 | Audio GenerationMusic Generation | —Unverified | 0 |
| NU-GAN: High resolution neural upsampling with GAN | Oct 22, 2020 | Audio GenerationSpeech Synthesis | —Unverified | 0 |
| Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder | Aug 16, 2020 | Audio DequantizationAudio Generation | —Unverified | 0 |
| Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning | Aug 7, 2020 | Audio Generationreinforcement-learning | —Unverified | 0 |
| Neural Granular Sound Synthesis | Aug 4, 2020 | Audio Generation | —Unverified | 0 |
| Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation | Jul 20, 2020 | Audio Generation | —Unverified | 0 |
| Audeo: Audio Generation for a Silent Performance Video | Jun 23, 2020 | Audio GenerationAudio Synthesis | CodeCode Available | 1 |
| Perceiving Music Quality with GANs | Jun 11, 2020 | Audio GenerationAudio Quality Assessment | CodeCode Available | 1 |
| High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder | Jun 1, 2020 | Audio GenerationRepresentation Learning | —Unverified | 0 |
| Unconditional Audio Generation with Generative Adversarial Networks and Cycle Regularization | May 18, 2020 | Audio GenerationGenerative Adversarial Network | CodeCode Available | 1 |
| GACELA -- A generative adversarial context encoder for long audio inpainting | May 11, 2020 | Audio GenerationAudio inpainting | CodeCode Available | 1 |
| Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data | Mar 5, 2020 | Audio GenerationEmotion Recognition | —Unverified | 0 |
| Cross-modal variational inference for bijective signal-symbol translation | Feb 10, 2020 | Audio GenerationDensity Estimation | —Unverified | 0 |
| FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA | Feb 9, 2020 | Audio GenerationAudio Synthesis | —Unverified | 0 |