SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 76100 of 112 papers

TitleStatusHype
Voice Adaptation for Swiss German0
VoiceCloak: A Multi-Dimensional Defense Framework against Unauthorized Diffusion-based Voice Cloning0
Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning0
Voice Cloning: Comprehensive Survey0
VoiceMark: Zero-Shot Voice Cloning-Resistant Watermarking Approach Leveraging Speaker-Specific Latents0
Xiaomingbot: A Multilingual Robot News Reporter0
a novel cross-lingual voice cloning approach with a few text-free samples0
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance0
Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems0
Pronunciation Deviation Analysis Through Voice Cloning and Acoustic Comparison0
Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion0
Revisiting IPA-based Cross-lingual Text-to-speech0
Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages0
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks0
Self-supervised learning for robust voice cloning0
SoK: How Robust is Audio Watermarking in Generative AI models?0
Speech Watermarking with Discrete Intermediate Representations0
Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech0
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech0
Steganography Beyond Space-Time with Chain of Multimodal AI0
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System0
The AS-NU System for the M2VoC Challenge0
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings0
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake DatasetCode0
Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech SynthesisCode0
Show:102550
← PrevPage 4 of 5Next →

No leaderboard results yet.