| Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music | Aug 22, 2024 | Audio Synthesis | —Unverified | 0 |
| Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound | Aug 21, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos | Jul 30, 2024 | Audio SynthesisVideo Summarization | —Unverified | 0 |
| Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech2 | Jul 19, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis | Jul 15, 2024 | Audio SynthesisDecoder | —Unverified | 0 |
| LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis | Jul 15, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 1 |
| Taming Data and Transformers for Audio Generation | Jun 27, 2024 | Audio captioningAudio Generation | CodeCode Available | 2 |
| AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Jun 13, 2024 | Audio SynthesisNeRF | —Unverified | 0 |
| CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems | Jun 11, 2024 | Audio SynthesisFace Swapping | —Unverified | 0 |
| Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis | Jun 7, 2024 | Audio Synthesis | CodeCode Available | 2 |