Audio Synthesis

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 127 papers

Title	Date	Tasks	Status	Hype	Score
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis	Dec 19, 2024	Audio GenerationAudio Synthesis	CodeCode Available	7	5
AudioLCM: Text-to-Audio Generation with Latent Consistency Models	Jun 1, 2024	Audio GenerationAudio Synthesis	CodeCode Available	5	5
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis	Jun 1, 2023	Audio SynthesisComputational Efficiency	CodeCode Available	4	5
Diffusion-TS: Interpretable Diffusion for General Time Series Generation	Mar 4, 2024	Audio SynthesisDecoder	CodeCode Available	3	5
BigVGAN: A Universal Neural Vocoder with Large-Scale Training	Jun 9, 2022	Audio GenerationAudio Synthesis	CodeCode Available	3	5
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis	Jun 7, 2024	Audio Synthesis	CodeCode Available	2	5
DiffMoog: a Differentiable Modular Synthesizer for Sound Matching	Jan 23, 2024	Audio Synthesis	CodeCode Available	2	5
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models	Dec 13, 2023	3D Face AnimationAudio Synthesis	CodeCode Available	2	5
RAVE: A variational autoencoder for fast and high-quality neural audio synthesis	Nov 9, 2021	Audio SynthesisCPU	CodeCode Available	2	5
Taming Data and Transformers for Audio Generation	Jun 27, 2024	Audio captioningAudio Generation	CodeCode Available	2	5
Efficient Neural Audio Synthesis	Feb 23, 2018	Audio SynthesisCPU	CodeCode Available	2	5
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models	Jun 29, 2023	Audio Synthesis	CodeCode Available	2	5
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows	Dec 2, 2024	Audio SynthesisImage Generation	CodeCode Available	2	5
Differentiable All-pole Filters for Time-varying Audio Systems	Apr 11, 2024	AllAudio Effects Modeling	CodeCode Available	2	5
DDSP: Differentiable Digital Signal Processing	Jan 14, 2020	Audio GenerationAudio Synthesis	CodeCode Available	2	5
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis	Jan 17, 2024	Audio GenerationAudio Synthesis	CodeCode Available	1	5
Tacotron: Towards End-to-End Speech Synthesis	Mar 29, 2017	Audio SynthesisSpeech Synthesis	CodeCode Available	1	5
Upsampling artifacts in neural audio synthesis	Oct 27, 2020	Audio Signal ProcessingAudio Synthesis	CodeCode Available	1	5
VaPar Synth -- A Variational Parametric Model for Audio Synthesis	Mar 30, 2020	Audio Synthesis	CodeCode Available	1	5
Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance	Dec 13, 2022	Audio SynthesisImage Generation	CodeCode Available	1	5
SpeedySpeech: Efficient Neural Speech Synthesis	Aug 9, 2020	Audio SynthesisCPU	CodeCode Available	1	5
Neural Waveshaping Synthesis	Jul 11, 2021	Audio GenerationAudio Synthesis	CodeCode Available	1	5
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders	Apr 5, 2017	Audio SynthesisDecoder	CodeCode Available	1	5
Continuous descriptor-based control for deep audio synthesis	Feb 27, 2023	Audio Synthesiscontinuous-control	CodeCode Available	1	5
Audeo: Audio Generation for a Silent Performance Video	Jun 23, 2020	Audio GenerationAudio Synthesis	CodeCode Available	1	5
Where are we in audio deepfake detection? A systematic analysis over generative and detection models	Oct 6, 2024	Audio Deepfake DetectionAudio Synthesis	CodeCode Available	1	5
Perceptual-Neural-Physical Sound Matching	Jan 7, 2023	AttributeAudio Synthesis	CodeCode Available	1	5
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation	Jul 7, 2021	Audio SynthesisImage Generation	CodeCode Available	1	5
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks	Aug 27, 2020	Audio SynthesisGenerative Adversarial Network	CodeCode Available	1	5
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling	Dec 17, 2021	Audio Synthesis	CodeCode Available	1	5
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling	Mar 4, 2018	Audio SynthesisLanguage Modelling	CodeCode Available	1	5
Points2Sound: From mono to binaural audio using 3D point cloud scenes	Apr 26, 2021	Audio Synthesis	CodeCode Available	1	5
Realistic Gramophone Noise Synthesis using a Diffusion Model	Jun 13, 2022	Audio Synthesis	CodeCode Available	1	5
Generative diffusion model with inverse renormalization group flows	Jan 15, 2025	Audio SynthesisDenoising	CodeCode Available	1	5
Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance	Jun 16, 2020	Audio Synthesis	CodeCode Available	1	5
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning	Sep 19, 2024	Audio Synthesis	CodeCode Available	1	5
Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and Plates	May 9, 2025	Audio SynthesisCPU	CodeCode Available	1	5
Adversarial Audio Synthesis	Feb 12, 2018	Audio GenerationAudio Synthesis	CodeCode Available	1	5
Differentiable Wavetable Synthesis	Nov 19, 2021	Audio SynthesisOne-Shot Learning	CodeCode Available	1	5
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis	Jun 4, 2021	Audio Synthesis	CodeCode Available	1	5
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding	May 4, 2021	Audio Synthesis	CodeCode Available	1	5
LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis	Jul 15, 2024	Audio GenerationAudio Synthesis	CodeCode Available	1	5
Real-time Timbre Transfer and Sound Synthesis using DDSP	Mar 12, 2021	Audio Synthesis	CodeCode Available	1	5
DiffWave: A Versatile Diffusion Model for Audio Synthesis	Sep 21, 2020	Audio SynthesisDiversity	CodeCode Available	1	5
SING: Symbol-to-Instrument Neural Generator	Oct 23, 2018	Audio SynthesisDecoder	CodeCode Available	0	5
Adversarial Generation of Time-Frequency Features with application in audio synthesis	Feb 11, 2019	Audio GenerationAudio Synthesis	CodeCode Available	0	5
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models	Oct 29, 2019	Audio Synthesis	CodeCode Available	0	5
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional Autoencoders	Nov 9, 2021	Audio SynthesisMixed Reality	CodeCode Available	0	5
From Words to Sound: Neural Audio Synthesis of Guitar Sounds with Timbral Descriptors	Sep 17, 2022	Audio Synthesis	CodeCode Available	0	5
Deep Voice: Real-time Neural Text-to-Speech	Feb 25, 2017	Audio SynthesisBoundary Detection	CodeCode Available	0	5

Show:10 25 50

← PrevPage 1 of 3Next →

No leaderboard results yet.