SOTAVerified

Audio Synthesis

Papers

Showing 51100 of 127 papers

TitleStatusHype
Adversarial Generation of Time-Frequency Features with application in audio synthesisCode0
GANSynth: Adversarial Neural Audio SynthesisCode0
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio SynthesisCode0
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative ModelsCode0
CAESynth: Real-Time Timbre Interpolation and Pitch Control with Conditional AutoencodersCode0
Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer0
Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance0
Streamable Neural Audio Synthesis With Non-Causal Convolutions0
Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis0
Synthesising Audio Adversarial Examples for Automatic Speech Recognition0
Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator0
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis0
Text2Data: Low-Resource Data Generation with Textual Control0
The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge0
Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations0
Transferring neural speech waveform synthesizers to musical instrument sounds generation0
Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control0
Unified speech and gesture synthesis using flow matching0
Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition0
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound0
VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space0
XAttnMark: Learning Robust Audio Watermarking with Cross-Attention0
Zero-Shot Mono-to-Binaural Speech Synthesis0
Adversarial Audio Synthesis with Complex-valued Polynomial Networks0
A Generative Model for Raw Audio Using Transformer Architectures0
Anisotropic multiresolution analyses for deepfake detection0
Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement0
A Novel Face-tracking Mouth Controller and its Application to Interacting with Bioacoustic Models0
Anyone GAN Sing0
Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals0
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing0
Autoencoding Neural Networks as Musical Audio Synthesizers0
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis0
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models0
CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems0
Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism0
Conditional variational autoencoder to improve neural audio synthesis for polyphonic music sound0
Convergence of denoising diffusion models under the manifold hypothesis0
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis0
Creative Text-to-Audio Generation via Synthesizer Programming0
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder0
Customized Condition Controllable Generation for Video Soundtrack0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing0
Deep generative models for musical audio synthesis0
Designing Neural Synthesizers for Low-Latency Interaction0
Diffusion-Based Symbolic Regression0
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect0
DPLM: A Deep Perceptual Spatial-Audio Localization Metric0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.