SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 1120 of 112 papers

TitleStatusHype
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song GenerationCode3
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesisCode2
Small-E: Small Language Model with Linear Attention for Efficient Speech SynthesisCode2
StyleDubber: Towards Multi-Scale Style Learning for Movie DubbingCode2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion ControlCode2
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation GenerationCode1
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency LossCode1
One Model, Many Languages: Meta-learning for Multilingual Text-to-SpeechCode1
Anonymizing Speech: Evaluating and Designing Speaker Anonymization TechniquesCode1
Single and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned FeaturesCode1
Show:102550
← PrevPage 2 of 12Next →

No leaderboard results yet.