| VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Nov 22, 2024 | Audio SynthesisDecoder | —Unverified | 0 | 0 |
| XAttnMark: Learning Robust Audio Watermarking with Cross-Attention | Feb 6, 2025 | Audio SynthesisFace Swapping | —Unverified | 0 | 0 |
| Zero-Shot Mono-to-Binaural Speech Synthesis | Dec 11, 2024 | Audio SynthesisDenoising | —Unverified | 0 | 0 |
| Adversarial Audio Synthesis with Complex-valued Polynomial Networks | Jun 14, 2022 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| A Generative Model for Raw Audio Using Transformer Architectures | Jun 30, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Anisotropic multiresolution analyses for deepfake detection | Oct 26, 2022 | Audio SynthesisDeepFake Detection | —Unverified | 0 | 0 |
| Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement | Oct 22, 2024 | Audio SynthesisDiversity | —Unverified | 0 | 0 |
| A Novel Face-tracking Mouth Controller and its Application to Interacting with Bioacoustic Models | Oct 7, 2020 | Audio Synthesis | —Unverified | 0 | 0 |
| Anyone GAN Sing | Feb 22, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals | Oct 8, 2024 | Audio Synthesis | —Unverified | 0 | 0 |