SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 1120 of 112 papers

TitleStatusHype
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice CloningCode3
Small-E: Small Language Model with Linear Attention for Efficient Speech SynthesisCode2
StyleDubber: Towards Multi-Scale Style Learning for Movie DubbingCode2
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesisCode2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion ControlCode2
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency LossCode1
Single and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned FeaturesCode1
Anonymizing Speech: Evaluating and Designing Speaker Anonymization TechniquesCode1
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation GenerationCode1
One Model, Many Languages: Meta-learning for Multilingual Text-to-SpeechCode1
Show:102550
← PrevPage 2 of 12Next →

No leaderboard results yet.