SOTAVerified

Audio Synthesis

Papers

Showing 2650 of 127 papers

TitleStatusHype
PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models0
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend ConditioningCode1
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis0
Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP0
Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music0
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound0
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos0
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis0
LiteFocus: Accelerated Diffusion Inference for Long Audio SynthesisCode1
Taming Data and Transformers for Audio GenerationCode2
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis0
CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems0
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-SynthesisCode2
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
Creative Text-to-Audio Generation via Synthesizer Programming0
Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting0
Differentiable All-pole Filters for Time-varying Audio SystemsCode2
Diffusion-TS: Interpretable Diffusion for General Time Series GenerationCode3
Text2Data: Low-Resource Data Generation with Textual Control0
DiffMoog: a Differentiable Modular Synthesizer for Sound MatchingCode2
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound SynthesisCode1
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head ModelsCode2
Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions0
Show:102550
← PrevPage 2 of 6Next →

No leaderboard results yet.