SOTAVerified

Audio Synthesis

Papers

Showing 150 of 127 papers

TitleStatusHype
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCode7
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisCode4
Diffusion-TS: Interpretable Diffusion for General Time Series GenerationCode3
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingCode3
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified FlowsCode2
Taming Data and Transformers for Audio GenerationCode2
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-SynthesisCode2
Differentiable All-pole Filters for Time-varying Audio SystemsCode2
DiffMoog: a Differentiable Modular Synthesizer for Sound MatchingCode2
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head ModelsCode2
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion ModelsCode2
RAVE: A variational autoencoder for fast and high-quality neural audio synthesisCode2
DDSP: Differentiable Digital Signal ProcessingCode2
Efficient Neural Audio SynthesisCode2
Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and PlatesCode1
Generative diffusion model with inverse renormalization group flowsCode1
Where are we in audio deepfake detection? A systematic analysis over generative and detection modelsCode1
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend ConditioningCode1
LiteFocus: Accelerated Diffusion Inference for Long Audio SynthesisCode1
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound SynthesisCode1
Continuous descriptor-based control for deep audio synthesisCode1
Perceptual-Neural-Physical Sound MatchingCode1
Score-based Generative Modeling Secretly Minimizes the Wasserstein DistanceCode1
Realistic Gramophone Noise Synthesis using a Diffusion ModelCode1
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical ModelingCode1
Differentiable Wavetable SynthesisCode1
Neural Waveshaping SynthesisCode1
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series ImputationCode1
Fre-GAN: Adversarial Frequency-consistent Audio SynthesisCode1
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive CodingCode1
Points2Sound: From mono to binaural audio using 3D point cloud scenesCode1
Real-time Timbre Transfer and Sound Synthesis using DDSPCode1
Upsampling artifacts in neural audio synthesisCode1
DiffWave: A Versatile Diffusion Model for Audio SynthesisCode1
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial NetworksCode1
SpeedySpeech: Efficient Neural Speech SynthesisCode1
Audeo: Audio Generation for a Silent Performance VideoCode1
Generative Modelling for Controllable Audio Synthesis of Expressive Piano PerformanceCode1
VaPar Synth -- A Variational Parametric Model for Audio SynthesisCode1
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence ModelingCode1
Adversarial Audio SynthesisCode1
Neural Audio Synthesis of Musical Notes with WaveNet AutoencodersCode1
Tacotron: Towards End-to-End Speech SynthesisCode1
MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling0
Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance0
Diffusion-Based Symbolic Regression0
SpecMaskFoley: Steering Pretrained Spectral Masked Generative Transformer Toward Synchronized Video-to-audio Synthesis via ControlNet0
Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism0
DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.