SOTAVerified

Audio Synthesis

Papers

Showing 111120 of 127 papers

TitleStatusHype
Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control0
Unified speech and gesture synthesis using flow matching0
Using Deep Learning Techniques and Inferential Speech Statistics for AI Synthesised Speech Recognition0
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound0
VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space0
XAttnMark: Learning Robust Audio Watermarking with Cross-Attention0
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis0
Text2Data: Low-Resource Data Generation with Textual Control0
SING: Symbol-to-Instrument Neural GeneratorCode0
GANSynth: Adversarial Neural Audio SynthesisCode0
Show:102550
← PrevPage 12 of 13Next →

No leaderboard results yet.