SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 101112 of 112 papers

TitleStatusHype
Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech0
Expressive Neural Voice Cloning0
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models0
Hindi audio-video-Deepfake (HAV-DF): A Hindi language-based Audio-video Deepfake Dataset0
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data0
Improve few-shot voice cloning using multi-modal learning0
"It's not a representation of me": Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services0
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices0
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion0
MARS6: A Small and Robust Hierarchical-Codec Text-to-Speech Model0
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis0
MemoryCompanion: A Smart Healthcare Solution to Empower Efficient Alzheimer's Care Via Unleashing Generative AI0
Show:102550
← PrevPage 3 of 3Next →

No leaderboard results yet.