SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 5160 of 112 papers

TitleStatusHype
Revisiting IPA-based Cross-lingual Text-to-speech0
Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages0
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks0
Self-supervised learning for robust voice cloning0
SoK: How Robust is Audio Watermarking in Generative AI models?0
Speech Watermarking with Discrete Intermediate Representations0
Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech0
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech0
Steganography Beyond Space-Time with Chain of Multimodal AI0
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System0
Show:102550
← PrevPage 6 of 12Next →

No leaderboard results yet.