| AudioLCM: Text-to-Audio Generation with Latent Consistency Models | Jun 1, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 5 |
| Creative Text-to-Audio Generation via Synthesizer Programming | Jun 1, 2024 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting | May 30, 2024 | Audio SynthesisRepresentation Learning | —Unverified | 0 |
| Differentiable All-pole Filters for Time-varying Audio Systems | Apr 11, 2024 | AllAudio Effects Modeling | CodeCode Available | 2 |
| Diffusion-TS: Interpretable Diffusion for General Time Series Generation | Mar 4, 2024 | Audio SynthesisDecoder | CodeCode Available | 3 |
| Text2Data: Low-Resource Data Generation with Textual Control | Feb 8, 2024 | Audio SynthesisTime Series | —Unverified | 0 |
| DiffMoog: a Differentiable Modular Synthesizer for Sound Matching | Jan 23, 2024 | Audio Synthesis | CodeCode Available | 2 |
| T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis | Jan 17, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 1 |
| FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models | Dec 13, 2023 | 3D Face AnimationAudio Synthesis | CodeCode Available | 2 |
| Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions | Oct 21, 2023 | Audio SynthesisGenerative Adversarial Network | —Unverified | 0 |