| Adversarial Generation of Time-Frequency Features with application in audio synthesis | Feb 11, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 | 5 |
| GANSynth: Adversarial Neural Audio Synthesis | Feb 23, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 | 5 |
| BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis | May 30, 2022 | Audio Synthesis | CodeCode Available | 0 | 5 |
| A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models | Oct 29, 2019 | Audio Synthesis | CodeCode Available | 0 | 5 |
| CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders | Nov 9, 2021 | Audio SynthesisMixed Reality | CodeCode Available | 0 | 5 |
| Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer | Sep 26, 2023 | Audio Synthesis | —Unverified | 0 | 0 |
| Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance | Jun 26, 2025 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Streamable Neural Audio Synthesis With Non-Causal Convolutions | Apr 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis | Apr 18, 2025 | Audio Synthesis | —Unverified | 0 | 0 |
| Synthesising Audio Adversarial Examples for Automatic Speech Recognition | Sep 29, 2021 | Audio SynthesisAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator | Jun 5, 2022 | Audio SynthesisDisentanglement | —Unverified | 0 | 0 |
| TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis | Apr 8, 2025 | Audio SynthesisFAD | —Unverified | 0 | 0 |
| Text2Data: Low-Resource Data Generation with Textual Control | Feb 8, 2024 | Audio SynthesisTime Series | —Unverified | 0 | 0 |
| The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge | Mar 3, 2022 | Audio Deepfake DetectionAudio Synthesis | —Unverified | 0 | 0 |
| Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations | Nov 14, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Transferring neural speech waveform synthesizers to musical instrument sounds generation | Oct 27, 2019 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control | Dec 29, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Unified speech and gesture synthesis using flow matching | Oct 8, 2023 | Audio SynthesisMotion Synthesis | —Unverified | 0 | 0 |
| Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition | Jul 23, 2021 | Audio Synthesisspeech-recognition | —Unverified | 0 | 0 |
| Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound | Aug 21, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Nov 22, 2024 | Audio SynthesisDecoder | —Unverified | 0 | 0 |
| XAttnMark: Learning Robust Audio Watermarking with Cross-Attention | Feb 6, 2025 | Audio SynthesisFace Swapping | —Unverified | 0 | 0 |
| Zero-Shot Mono-to-Binaural Speech Synthesis | Dec 11, 2024 | Audio SynthesisDenoising | —Unverified | 0 | 0 |
| Adversarial Audio Synthesis with Complex-valued Polynomial Networks | Jun 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| A Generative Model for Raw Audio Using Transformer Architectures | Jun 30, 2021 | Audio Synthesis | —Unverified | 0 | 0 |