SOTAVerified

Audio Synthesis

Papers

Showing 150 of 127 papers

TitleStatusHype
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCode7
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisCode4
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingCode3
Diffusion-TS: Interpretable Diffusion for General Time Series GenerationCode3
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-SynthesisCode2
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head ModelsCode2
DDSP: Differentiable Digital Signal ProcessingCode2
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion ModelsCode2
Efficient Neural Audio SynthesisCode2
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified FlowsCode2
DiffMoog: a Differentiable Modular Synthesizer for Sound MatchingCode2
Taming Data and Transformers for Audio GenerationCode2
Differentiable All-pole Filters for Time-varying Audio SystemsCode2
RAVE: A variational autoencoder for fast and high-quality neural audio synthesisCode2
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial NetworksCode1
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive CodingCode1
Real-time Timbre Transfer and Sound Synthesis using DDSPCode1
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend ConditioningCode1
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound SynthesisCode1
Upsampling artifacts in neural audio synthesisCode1
Realistic Gramophone Noise Synthesis using a Diffusion ModelCode1
Score-based Generative Modeling Secretly Minimizes the Wasserstein DistanceCode1
Continuous descriptor-based control for deep audio synthesisCode1
Audeo: Audio Generation for a Silent Performance VideoCode1
VaPar Synth -- A Variational Parametric Model for Audio SynthesisCode1
DiffWave: A Versatile Diffusion Model for Audio SynthesisCode1
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series ImputationCode1
Where are we in audio deepfake detection? A systematic analysis over generative and detection modelsCode1
Perceptual-Neural-Physical Sound MatchingCode1
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence ModelingCode1
Points2Sound: From mono to binaural audio using 3D point cloud scenesCode1
SpeedySpeech: Efficient Neural Speech SynthesisCode1
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical ModelingCode1
Generative Modelling for Controllable Audio Synthesis of Expressive Piano PerformanceCode1
Generative diffusion model with inverse renormalization group flowsCode1
LiteFocus: Accelerated Diffusion Inference for Long Audio SynthesisCode1
Neural Audio Synthesis of Musical Notes with WaveNet AutoencodersCode1
Differentiable Wavetable SynthesisCode1
Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and PlatesCode1
Adversarial Audio SynthesisCode1
Fre-GAN: Adversarial Frequency-consistent Audio SynthesisCode1
Neural Waveshaping SynthesisCode1
Tacotron: Towards End-to-End Speech SynthesisCode1
Customized Condition Controllable Generation for Video Soundtrack0
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder0
Autoencoding Neural Networks as Musical Audio Synthesizers0
Creative Text-to-Audio Generation via Synthesizer Programming0
Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement0
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.