| Adversarial Generation of Time-Frequency Features with application in audio synthesis | Feb 11, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 | 5 |
| GANSynth: Adversarial Neural Audio Synthesis | Feb 23, 2019 | Audio GenerationAudio Synthesis | CodeCode Available | 0 | 5 |
| BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis | May 30, 2022 | Audio Synthesis | CodeCode Available | 0 | 5 |
| A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models | Oct 29, 2019 | Audio Synthesis | CodeCode Available | 0 | 5 |
| CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders | Nov 9, 2021 | Audio SynthesisMixed Reality | CodeCode Available | 0 | 5 |
| Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer | Sep 26, 2023 | Audio Synthesis | —Unverified | 0 | 0 |
| Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance | Jun 26, 2025 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Streamable Neural Audio Synthesis With Non-Causal Convolutions | Apr 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis | Apr 18, 2025 | Audio Synthesis | —Unverified | 0 | 0 |
| Synthesising Audio Adversarial Examples for Automatic Speech Recognition | Sep 29, 2021 | Audio SynthesisAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator | Jun 5, 2022 | Audio SynthesisDisentanglement | —Unverified | 0 | 0 |
| TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis | Apr 8, 2025 | Audio SynthesisFAD | —Unverified | 0 | 0 |
| Text2Data: Low-Resource Data Generation with Textual Control | Feb 8, 2024 | Audio SynthesisTime Series | —Unverified | 0 | 0 |
| The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge | Mar 3, 2022 | Audio Deepfake DetectionAudio Synthesis | —Unverified | 0 | 0 |
| Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations | Nov 14, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Transferring neural speech waveform synthesizers to musical instrument sounds generation | Oct 27, 2019 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control | Dec 29, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Unified speech and gesture synthesis using flow matching | Oct 8, 2023 | Audio SynthesisMotion Synthesis | —Unverified | 0 | 0 |
| Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition | Jul 23, 2021 | Audio Synthesisspeech-recognition | —Unverified | 0 | 0 |
| Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound | Aug 21, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Nov 22, 2024 | Audio SynthesisDecoder | —Unverified | 0 | 0 |
| XAttnMark: Learning Robust Audio Watermarking with Cross-Attention | Feb 6, 2025 | Audio SynthesisFace Swapping | —Unverified | 0 | 0 |
| Zero-Shot Mono-to-Binaural Speech Synthesis | Dec 11, 2024 | Audio SynthesisDenoising | —Unverified | 0 | 0 |
| Adversarial Audio Synthesis with Complex-valued Polynomial Networks | Jun 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| A Generative Model for Raw Audio Using Transformer Architectures | Jun 30, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Anisotropic multiresolution analyses for deepfake detection | Oct 26, 2022 | Audio SynthesisDeepFake Detection | —Unverified | 0 | 0 |
| Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement | Oct 22, 2024 | Audio SynthesisDiversity | —Unverified | 0 | 0 |
| A Novel Face-tracking Mouth Controller and its Application to Interacting with Bioacoustic Models | Oct 7, 2020 | Audio Synthesis | —Unverified | 0 | 0 |
| Anyone GAN Sing | Feb 22, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals | Oct 8, 2024 | Audio Synthesis | —Unverified | 0 | 0 |
| A Survey on Audio Synthesis and Audio-Visual Multimodal Processing | Aug 1, 2021 | Audio SynthesisMusic Generation | —Unverified | 0 | 0 |
| Autoencoding Neural Networks as Musical Audio Synthesizers | Apr 27, 2020 | Audio SynthesisBIG-bench Machine Learning | —Unverified | 0 | 0 |
| AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Jun 13, 2024 | Audio SynthesisNeRF | —Unverified | 0 | 0 |
| Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech2 | Jul 19, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models | Jun 16, 2023 | Audio Synthesis | —Unverified | 0 | 0 |
| CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems | Jun 11, 2024 | Audio SynthesisFace Swapping | —Unverified | 0 | 0 |
| Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism | May 20, 2025 | Audio SynthesisDenoising | —Unverified | 0 | 0 |
| Conditional variational autoencoder to improve neural audio synthesis for polyphonic music sound | Nov 16, 2022 | Audio Synthesis | —Unverified | 0 | 0 |
| Convergence of denoising diffusion models under the manifold hypothesis | Aug 10, 2022 | Audio SynthesisDenoising | —Unverified | 0 | 0 |
| CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis | Jun 14, 2021 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Creative Text-to-Audio Generation via Synthesizer Programming | Jun 1, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder | Dec 12, 2024 | Audio SynthesisSinging Voice Synthesis | —Unverified | 0 | 0 |
| Customized Condition Controllable Generation for Video Soundtrack | Jan 1, 2025 | Audio Synthesis | —Unverified | 0 | 0 |
| D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack | Sep 11, 2024 | Adversarial AttackAudio Synthesis | —Unverified | 0 | 0 |
| DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing | Sep 14, 2023 | AttributeAudio Synthesis | —Unverified | 0 | 0 |
| Deep generative models for musical audio synthesis | Jun 10, 2020 | Audio SynthesisDeep Learning | —Unverified | 0 | 0 |
| Designing Neural Synthesizers for Low-Latency Interaction | Mar 14, 2025 | Audio Synthesis | —Unverified | 0 | 0 |
| Diffusion-Based Symbolic Regression | May 30, 2025 | Audio SynthesisDenoising | —Unverified | 0 | 0 |
| DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect | Dec 14, 2022 | Audio Synthesis | —Unverified | 0 | 0 |
| DPLM: A Deep Perceptual Spatial-Audio Localization Metric | May 29, 2021 | Audio Synthesis | —Unverified | 0 | 0 |