SOTAVerified

Audio Synthesis

Papers

Showing 150 of 127 papers

TitleStatusHype
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCode7
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisCode4
Diffusion-TS: Interpretable Diffusion for General Time Series GenerationCode3
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingCode3
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-SynthesisCode2
DiffMoog: a Differentiable Modular Synthesizer for Sound MatchingCode2
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head ModelsCode2
RAVE: A variational autoencoder for fast and high-quality neural audio synthesisCode2
Taming Data and Transformers for Audio GenerationCode2
Efficient Neural Audio SynthesisCode2
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion ModelsCode2
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified FlowsCode2
Differentiable All-pole Filters for Time-varying Audio SystemsCode2
DDSP: Differentiable Digital Signal ProcessingCode2
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound SynthesisCode1
Tacotron: Towards End-to-End Speech SynthesisCode1
Upsampling artifacts in neural audio synthesisCode1
VaPar Synth -- A Variational Parametric Model for Audio SynthesisCode1
Score-based Generative Modeling Secretly Minimizes the Wasserstein DistanceCode1
SpeedySpeech: Efficient Neural Speech SynthesisCode1
Neural Waveshaping SynthesisCode1
Neural Audio Synthesis of Musical Notes with WaveNet AutoencodersCode1
Continuous descriptor-based control for deep audio synthesisCode1
Audeo: Audio Generation for a Silent Performance VideoCode1
Where are we in audio deepfake detection? A systematic analysis over generative and detection modelsCode1
Perceptual-Neural-Physical Sound MatchingCode1
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series ImputationCode1
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial NetworksCode1
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical ModelingCode1
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence ModelingCode1
Points2Sound: From mono to binaural audio using 3D point cloud scenesCode1
Realistic Gramophone Noise Synthesis using a Diffusion ModelCode1
Generative diffusion model with inverse renormalization group flowsCode1
Generative Modelling for Controllable Audio Synthesis of Expressive Piano PerformanceCode1
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend ConditioningCode1
Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and PlatesCode1
Adversarial Audio SynthesisCode1
Differentiable Wavetable SynthesisCode1
Fre-GAN: Adversarial Frequency-consistent Audio SynthesisCode1
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive CodingCode1
LiteFocus: Accelerated Diffusion Inference for Long Audio SynthesisCode1
Real-time Timbre Transfer and Sound Synthesis using DDSPCode1
DiffWave: A Versatile Diffusion Model for Audio SynthesisCode1
SING: Symbol-to-Instrument Neural GeneratorCode0
Adversarial Generation of Time-Frequency Features with application in audio synthesisCode0
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative ModelsCode0
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional AutoencodersCode0
From Words to Sound: Neural Audio Synthesis of Guitar Sounds with Timbral DescriptorsCode0
Deep Voice: Real-time Neural Text-to-SpeechCode0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.