| Generative Audio Synthesis with a Parametric Model | Nov 15, 2019 | Audio Synthesismodel | —Unverified | 0 |
| GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis | Jul 15, 2024 | Audio SynthesisDecoder | —Unverified | 0 |
| Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music | Aug 22, 2024 | Audio Synthesis | —Unverified | 0 |
| HpRNet : Incorporating Residual Noise Modeling for Violin in a Variational Parametric Synthesizer | Aug 19, 2020 | Audio Synthesis | —Unverified | 0 |
| TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis | Apr 8, 2025 | Audio SynthesisFAD | —Unverified | 0 |
| Text2Data: Low-Resource Data Generation with Textual Control | Feb 8, 2024 | Audio SynthesisTime Series | —Unverified | 0 |
| The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge | Mar 3, 2022 | Audio Deepfake DetectionAudio Synthesis | —Unverified | 0 |
| Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations | Nov 14, 2021 | Audio Synthesis | —Unverified | 0 |
| Transferring neural speech waveform synthesizers to musical instrument sounds generation | Oct 27, 2019 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control | Dec 29, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| Unified speech and gesture synthesis using flow matching | Oct 8, 2023 | Audio SynthesisMotion Synthesis | —Unverified | 0 |
| Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition | Jul 23, 2021 | Audio Synthesisspeech-recognition | —Unverified | 0 |
| Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound | Aug 21, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Nov 22, 2024 | Audio SynthesisDecoder | —Unverified | 0 |
| XAttnMark: Learning Robust Audio Watermarking with Cross-Attention | Feb 6, 2025 | Audio SynthesisFace Swapping | —Unverified | 0 |
| Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator | Jun 5, 2022 | Audio SynthesisDisentanglement | —Unverified | 0 |
| SING: Symbol-to-Instrument Neural Generator | Oct 23, 2018 | Audio SynthesisDecoder | CodeCode Available | 0 |
| Adversarial Generation of Time-Frequency Features with application in audio synthesis | Feb 11, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 |
| Music Source Separation in the Waveform Domain | Nov 27, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 |
| Introducing Latent Timbre Synthesis | May 31, 2020 | Audio Synthesis | CodeCode Available | 0 |
| From Words to Sound: Neural Audio Synthesis of Guitar Sounds with Timbral Descriptors | Sep 17, 2022 | Audio Synthesis | CodeCode Available | 0 |
| A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models | Oct 29, 2019 | Audio Synthesis | CodeCode Available | 0 |
| GANSynth: Adversarial Neural Audio Synthesis | Feb 23, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 |
| Deep Voice: Real-time Neural Text-to-Speech | Feb 25, 2017 | Audio SynthesisBoundary Detection | CodeCode Available | 0 |
| WaveGlow: A Flow-based Generative Network for Speech Synthesis | Oct 31, 2018 | Audio SynthesisGPU | CodeCode Available | 0 |