SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 1120 of 112 papers

TitleStatusHype
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice CloningCode3
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesisCode2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion ControlCode2
Small-E: Small Language Model with Linear Attention for Efficient Speech SynthesisCode2
StyleDubber: Towards Multi-Scale Style Learning for Movie DubbingCode2
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation GenerationCode1
XTTS: a Massively Multilingual Zero-Shot Text-to-Speech ModelCode1
Anonymizing Speech: Evaluating and Designing Speaker Anonymization TechniquesCode1
Single and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned FeaturesCode1
Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via TextCode1
Show:102550
← PrevPage 2 of 12Next →

No leaderboard results yet.