SOTAVerified

Audio Synthesis

Papers

Showing 51100 of 127 papers

TitleStatusHype
Unified speech and gesture synthesis using flow matching0
Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer0
DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing0
SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias0
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion ModelsCode2
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models0
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisCode4
Continuous descriptor-based control for deep audio synthesisCode1
ECGAN: Self-supervised generative adversarial network for electrocardiography0
Perceptual-Neural-Physical Sound MatchingCode1
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect0
Score-based Generative Modeling Secretly Minimizes the Wasserstein DistanceCode1
Conditional variational autoencoder to improve neural audio synthesis for polyphonic music sound0
Full-band General Audio Synthesis with Score-based Diffusion0
Anisotropic multiresolution analyses for deepfake detection0
From Words to Sound: Neural Audio Synthesis of Guitar Sounds with Timbral DescriptorsCode0
Evaluating generative audio systems and their metrics0
Convergence of denoising diffusion models under the manifold hypothesis0
Adversarial Audio Synthesis with Complex-valued Polynomial Networks0
Realistic Gramophone Noise Synthesis using a Diffusion ModelCode1
BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingCode3
Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator0
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio SynthesisCode0
Streamable Neural Audio Synthesis With Non-Causal Convolutions0
The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge0
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical ModelingCode1
Differentiable Wavetable SynthesisCode1
Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations0
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional AutoencodersCode0
RAVE: A variational autoencoder for fast and high-quality neural audio synthesisCode2
Estimating High Order Gradients of the Data Distribution by Denoising0
Synthesising Audio Adversarial Examples for Automatic Speech Recognition0
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing0
Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition0
Neural Waveshaping SynthesisCode1
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series ImputationCode1
A Generative Model for Raw Audio Using Transformer Architectures0
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis0
Fre-GAN: Adversarial Frequency-consistent Audio SynthesisCode1
DPLM: A Deep Perceptual Spatial-Audio Localization Metric0
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive CodingCode1
Points2Sound: From mono to binaural audio using 3D point cloud scenesCode1
On tuning consistent annealed sampling for denoising score matching0
Signal Representations for Synthesizing Audio Textures with Generative Adversarial Networks0
Real-time Timbre Transfer and Sound Synthesis using DDSPCode1
Anyone GAN Sing0
Upsampling artifacts in neural audio synthesisCode1
A Novel Face-tracking Mouth Controller and its Application to Interacting with Bioacoustic Models0
DiffWave: A Versatile Diffusion Model for Audio SynthesisCode1
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial NetworksCode1
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.