| Introducing Latent Timbre Synthesis | May 31, 2020 | Audio Synthesis | CodeCode Available | 0 | 5 |
| CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders | Nov 9, 2021 | Audio SynthesisMixed Reality | CodeCode Available | 0 | 5 |
| WaveGlow: A Flow-based Generative Network for Speech Synthesis | Oct 31, 2018 | Audio SynthesisGPU | CodeCode Available | 0 | 5 |
| Real-valued parametric conditioning of an RNN for interactive sound synthesis | May 28, 2018 | Audio Synthesis | —Unverified | 0 | 0 |
| Signal Representations for Synthesizing Audio Textures with Generative Adversarial Networks | Mar 12, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias | Sep 14, 2023 | Audio SynthesisGenerative Adversarial Network | —Unverified | 0 | 0 |
| SpecMaskFoley: Steering Pretrained Spectral Masked Generative Transformer Toward Synchronized Video-to-audio Synthesis via ControlNet | May 22, 2025 | Audio Synthesis | —Unverified | 0 | 0 |
| Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer | Sep 26, 2023 | Audio Synthesis | —Unverified | 0 | 0 |
| Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance | Jun 26, 2025 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Streamable Neural Audio Synthesis With Non-Causal Convolutions | Apr 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis | Apr 18, 2025 | Audio Synthesis | —Unverified | 0 | 0 |
| Synthesising Audio Adversarial Examples for Automatic Speech Recognition | Sep 29, 2021 | Audio SynthesisAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator | Jun 5, 2022 | Audio SynthesisDisentanglement | —Unverified | 0 | 0 |
| TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis | Apr 8, 2025 | Audio SynthesisFAD | —Unverified | 0 | 0 |
| Text2Data: Low-Resource Data Generation with Textual Control | Feb 8, 2024 | Audio SynthesisTime Series | —Unverified | 0 | 0 |
| The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge | Mar 3, 2022 | Audio Deepfake DetectionAudio Synthesis | —Unverified | 0 | 0 |
| Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations | Nov 14, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Transferring neural speech waveform synthesizers to musical instrument sounds generation | Oct 27, 2019 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control | Dec 29, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Unified speech and gesture synthesis using flow matching | Oct 8, 2023 | Audio SynthesisMotion Synthesis | —Unverified | 0 | 0 |
| Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition | Jul 23, 2021 | Audio Synthesisspeech-recognition | —Unverified | 0 | 0 |
| Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound | Aug 21, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Nov 22, 2024 | Audio SynthesisDecoder | —Unverified | 0 | 0 |
| XAttnMark: Learning Robust Audio Watermarking with Cross-Attention | Feb 6, 2025 | Audio SynthesisFace Swapping | —Unverified | 0 | 0 |
| Zero-Shot Mono-to-Binaural Speech Synthesis | Dec 11, 2024 | Audio SynthesisDenoising | —Unverified | 0 | 0 |