SOTAVerified

Audio Synthesis

Papers

Showing 5175 of 127 papers

TitleStatusHype
Adversarial Generation of Time-Frequency Features with application in audio synthesisCode0
GANSynth: Adversarial Neural Audio SynthesisCode0
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio SynthesisCode0
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative ModelsCode0
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional AutoencodersCode0
Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer0
Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance0
Streamable Neural Audio Synthesis With Non-Causal Convolutions0
Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis0
Synthesising Audio Adversarial Examples for Automatic Speech Recognition0
Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator0
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis0
Text2Data: Low-Resource Data Generation with Textual Control0
The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge0
Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations0
Transferring neural speech waveform synthesizers to musical instrument sounds generation0
Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control0
Unified speech and gesture synthesis using flow matching0
Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition0
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound0
VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space0
XAttnMark: Learning Robust Audio Watermarking with Cross-Attention0
Zero-Shot Mono-to-Binaural Speech Synthesis0
Adversarial Audio Synthesis with Complex-valued Polynomial Networks0
A Generative Model for Raw Audio Using Transformer Architectures0
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.