| Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator | Jun 5, 2022 | Audio SynthesisDisentanglement | —Unverified | 0 | 0 |
| TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis | Apr 8, 2025 | Audio SynthesisFAD | —Unverified | 0 | 0 |
| Text2Data: Low-Resource Data Generation with Textual Control | Feb 8, 2024 | Audio SynthesisTime Series | —Unverified | 0 | 0 |
| The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge | Mar 3, 2022 | Audio Deepfake DetectionAudio Synthesis | —Unverified | 0 | 0 |
| Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations | Nov 14, 2021 | Audio Synthesis | —Unverified | 0 | 0 |
| Transferring neural speech waveform synthesizers to musical instrument sounds generation | Oct 27, 2019 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control | Dec 29, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |
| Unified speech and gesture synthesis using flow matching | Oct 8, 2023 | Audio SynthesisMotion Synthesis | —Unverified | 0 | 0 |
| Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition | Jul 23, 2021 | Audio Synthesisspeech-recognition | —Unverified | 0 | 0 |
| Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound | Aug 21, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 | 0 |