SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 3140 of 112 papers

TitleStatusHype
Few-Shot Speech Deepfake Detection Adaptation with Gaussian ProcessesCode0
Low-Resource Multilingual and Zero-Shot Multispeaker TTSCode0
Is Audio Spoof Detection Robust to Laundering Attacks?Code0
ClonEval: An Open Voice Cloning BenchmarkCode0
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-SpeechCode0
SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM DevelopmentCode0
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis0
Can DeepFake Speech be Reliably Detected?0
Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language0
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing0
Show:102550
← PrevPage 4 of 12Next →

No leaderboard results yet.