| Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization | Feb 3, 2024 | Audio GenerationDenoising | —Unverified | 0 |
| Bass Accompaniment Generation via Latent Diffusion | Feb 2, 2024 | Audio Generation | —Unverified | 0 |
| ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering | Jan 14, 2024 | Audio GenerationLanguage Modeling | —Unverified | 0 |
| Masked Audio Generation using a Single Non-Autoregressive Transformer | Jan 9, 2024 | Audio Generation | —Unverified | 0 |
| Efficient Parallel Audio Generation using Group Masked Language Modeling | Jan 2, 2024 | Audio GenerationComputational Efficiency | —Unverified | 0 |
| Cyclic Learning for Binaural Audio Generation and Localization | Jan 1, 2024 | Audio GenerationObject | —Unverified | 0 |
| Audiobox: Unified Audio Generation with Natural Language Prompts | Dec 25, 2023 | AudioCapsAudio Generation | —Unverified | 0 |
| Diffusion-EXR: Controllable Review Generation for Explainable Recommendation via Diffusion Models | Dec 24, 2023 | Audio GenerationDenoising | —Unverified | 0 |
| CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling | Dec 8, 2023 | Audio Generation | —Unverified | 0 |
| SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement | Dec 4, 2023 | Audio GenerationSpeech Enhancement | —Unverified | 0 |
| tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models | Nov 24, 2023 | Audio GenerationEvent Detection | —Unverified | 0 |
| Cross-modal Generative Model for Visual-Guided Binaural Stereo Generation | Nov 13, 2023 | AttributeAudio Generation | —Unverified | 0 |
| On The Open Prompt Challenge In Conditional Audio Generation | Nov 1, 2023 | Audio Generation | —Unverified | 0 |
| In-Context Prompt Editing For Conditional Audio Generation | Nov 1, 2023 | Audio GenerationRetrieval | —Unverified | 0 |
| Audio Editing with Non-Rigid Text Prompts | Oct 19, 2023 | Audio GenerationStyle Transfer | —Unverified | 0 |
| FoleyGen: Visually-Guided Audio Generation | Sep 19, 2023 | Audio GenerationLanguage Modeling | —Unverified | 0 |
| Enhance audio generation controllability through representation similarity regularization | Sep 15, 2023 | Audio GenerationLanguage Modeling | —Unverified | 0 |
| Retrieval-Augmented Text-to-Audio Generation | Sep 14, 2023 | AudioCapsAudio Generation | —Unverified | 0 |
| Advances in machine-learning-based sampling motivated by lattice quantum chromodynamics | Sep 3, 2023 | Audio Generation | —Unverified | 0 |
| An Initial Exploration: Learning to Generate Realistic Audio for Silent Video | Aug 23, 2023 | Audio Generation | CodeCode Available | 0 |
| Audio Generation with Multiple Conditional Diffusion Model | Aug 23, 2023 | Audio GenerationDiversity | —Unverified | 0 |
| IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models | Jul 24, 2023 | Audio GenerationMusic Generation | —Unverified | 0 |
| A Demand-Driven Perspective on Generative Audio AI | Jul 10, 2023 | Audio GenerationSurvey | —Unverified | 0 |
| LM-VC: Zero-shot Voice Conversion via Speech Generation based on Language Models | Jun 18, 2023 | Audio GenerationDisentanglement | —Unverified | 0 |
| MuseCoco: Generating Symbolic Music from Text | May 31, 2023 | AttributeAudio Generation | CodeCode Available | 0 |
| DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment | May 22, 2023 | AudioCapsAudio Generation | —Unverified | 0 |
| Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation | Mar 29, 2023 | Audio GenerationContrastive Learning | CodeCode Available | 0 |
| Leveraging Pre-trained AudioLDM for Sound Generation: A Benchmark Study | Mar 7, 2023 | Audio GenerationBenchmarking | —Unverified | 0 |
| SingSong: Generating musical accompaniments from singing | Jan 30, 2023 | Audio GenerationRetrieval | —Unverified | 0 |
| SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning | Oct 16, 2022 | Audio GenerationRepresentation Learning | —Unverified | 0 |
| Audio Deepfake Attribution: An Initial Dataset and Investigation | Aug 21, 2022 | Audio GenerationBinary Classification | —Unverified | 0 |
| Adversarial Audio Synthesis with Complex-valued Polynomial Networks | Jun 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| FlexLip: A Controllable Text-to-Lip System | Jun 7, 2022 | Audio Generationtext-to-speech | —Unverified | 0 |
| On Target Representation in Continuous-output Neural Machine Translation | May 1, 2022 | Audio GenerationMachine Translation | —Unverified | 0 |
| Streamable Neural Audio Synthesis With Non-Causal Convolutions | Apr 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| ADD 2022: the First Audio Deep Synthesis Detection Challenge | Feb 17, 2022 | Audio Deepfake DetectionAudio Generation | —Unverified | 0 |
| Soundify: Matching Sound Effects to Video | Dec 17, 2021 | Audio GenerationImage Classification | —Unverified | 0 |
| Geometry-Aware Multi-Task Learning for Binaural Audio Generation from Video | Nov 21, 2021 | Audio GenerationMulti-Task Learning | —Unverified | 0 |
| An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution | Sep 30, 2021 | Audio GenerationAudio Super-Resolution | —Unverified | 0 |
| Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention | Aug 10, 2021 | Audio GenerationDecoder | —Unverified | 0 |
| CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis | Jun 14, 2021 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior | Jun 11, 2021 | Audio GenerationDenoising | —Unverified | 0 |
| Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation | May 3, 2021 | Audio GenerationSelf-Supervised Learning | —Unverified | 0 |
| Visually Informed Binaural Audio Generation without Binaural Audios | Apr 13, 2021 | Audio Generation | —Unverified | 0 |
| A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions | Nov 13, 2020 | Audio GenerationMusic Generation | —Unverified | 0 |
| NU-GAN: High resolution neural upsampling with GAN | Oct 22, 2020 | Audio GenerationSpeech Synthesis | —Unverified | 0 |
| Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder | Aug 16, 2020 | Audio DequantizationAudio Generation | —Unverified | 0 |
| Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning | Aug 7, 2020 | Audio Generationreinforcement-learning | —Unverified | 0 |
| Neural Granular Sound Synthesis | Aug 4, 2020 | Audio Generation | —Unverified | 0 |
| Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation | Jul 20, 2020 | Audio Generation | —Unverified | 0 |