SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 2650 of 112 papers

TitleStatusHype
Is Audio Spoof Detection Robust to Laundering Attacks?Code0
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-SpeechCode0
SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM DevelopmentCode0
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech SynthesisCode0
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and MachinesCode0
Few-Shot Speech Deepfake Detection Adaptation with Gaussian ProcessesCode0
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake DatasetCode0
Neural Voice Cloning with a Few SamplesCode0
ClonEval: An Open Voice Cloning BenchmarkCode0
Low-Resource Multilingual and Zero-Shot Multispeaker TTSCode0
Discovery of Single Independent Latent VariableCode0
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis0
Can DeepFake Speech be Reliably Detected?0
Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language0
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing0
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis0
Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection0
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion0
Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset0
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices0
"It's not a representation of me": Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services0
De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Show:102550
← PrevPage 2 of 5Next →

No leaderboard results yet.