SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 7180 of 112 papers

TitleStatusHype
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained HubertCode4
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-SpeechCode6
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System0
Low-Resource Multilingual and Zero-Shot Multispeaker TTS0
Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech SynthesisCode0
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS)0
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE0
Dictionary Attacks on Speaker VerificationCode0
Self-supervised learning for robust voice cloning0
Improve few-shot voice cloning using multi-modal learning0
Show:102550
← PrevPage 8 of 12Next →

No leaderboard results yet.