SOTAVerified

Audio Synthesis

Papers

Showing 150 of 127 papers

TitleStatusHype
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCode7
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisCode4
Diffusion-TS: Interpretable Diffusion for General Time Series GenerationCode3
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingCode3
DiffMoog: a Differentiable Modular Synthesizer for Sound MatchingCode2
Efficient Neural Audio SynthesisCode2
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head ModelsCode2
DDSP: Differentiable Digital Signal ProcessingCode2
Taming Data and Transformers for Audio GenerationCode2
Differentiable All-pole Filters for Time-varying Audio SystemsCode2
RAVE: A variational autoencoder for fast and high-quality neural audio synthesisCode2
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified FlowsCode2
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-SynthesisCode2
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion ModelsCode2
SpeedySpeech: Efficient Neural Speech SynthesisCode1
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series ImputationCode1
Where are we in audio deepfake detection? A systematic analysis over generative and detection modelsCode1
Score-based Generative Modeling Secretly Minimizes the Wasserstein DistanceCode1
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound SynthesisCode1
Neural Audio Synthesis of Musical Notes with WaveNet AutoencodersCode1
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical ModelingCode1
Realistic Gramophone Noise Synthesis using a Diffusion ModelCode1
Continuous descriptor-based control for deep audio synthesisCode1
Audeo: Audio Generation for a Silent Performance VideoCode1
Real-time Timbre Transfer and Sound Synthesis using DDSPCode1
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive CodingCode1
Neural Waveshaping SynthesisCode1
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial NetworksCode1
Tacotron: Towards End-to-End Speech SynthesisCode1
LiteFocus: Accelerated Diffusion Inference for Long Audio SynthesisCode1
Differentiable Wavetable SynthesisCode1
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence ModelingCode1
Points2Sound: From mono to binaural audio using 3D point cloud scenesCode1
Upsampling artifacts in neural audio synthesisCode1
Generative Modelling for Controllable Audio Synthesis of Expressive Piano PerformanceCode1
Generative diffusion model with inverse renormalization group flowsCode1
Perceptual-Neural-Physical Sound MatchingCode1
VaPar Synth -- A Variational Parametric Model for Audio SynthesisCode1
Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and PlatesCode1
Adversarial Audio SynthesisCode1
Fre-GAN: Adversarial Frequency-consistent Audio SynthesisCode1
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend ConditioningCode1
DiffWave: A Versatile Diffusion Model for Audio SynthesisCode1
Adversarial Generation of Time-Frequency Features with application in audio synthesisCode0
SING: Symbol-to-Instrument Neural GeneratorCode0
Music Source Separation in the Waveform DomainCode0
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative ModelsCode0
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional AutoencodersCode0
Introducing Latent Timbre SynthesisCode0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.