SOTAVerified

Audio Synthesis

Papers

Showing 5175 of 127 papers

TitleStatusHype
Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis0
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis0
Designing Neural Synthesizers for Low-Latency Interaction0
Long-Video Audio Synthesis with Multi-Agent Collaboration0
Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision0
XAttnMark: Learning Robust Audio Watermarking with Cross-Attention0
Customized Condition Controllable Generation for Video Soundtrack0
Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control0
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder0
Zero-Shot Mono-to-Binaural Speech Synthesis0
Generalized Diffusion Model with Adjusted Offset Noise0
VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space0
Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement0
Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals0
PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis0
Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP0
Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music0
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound0
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos0
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis0
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis0
CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems0
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.