SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 51100 of 112 papers

TitleStatusHype
Revisiting IPA-based Cross-lingual Text-to-speech0
Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages0
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks0
Self-supervised learning for robust voice cloning0
SoK: How Robust is Audio Watermarking in Generative AI models?0
Speech Watermarking with Discrete Intermediate Representations0
Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech0
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech0
Steganography Beyond Space-Time with Chain of Multimodal AI0
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System0
The AS-NU System for the M2VoC Challenge0
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings0
The Multi-speaker Multi-style Voice Cloning Challenge 20210
Learning Through AI-Clones: Enhancing Self-Perception and Presentation Performance0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement0
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation0
TRAVID: An End-to-End Video Translation Framework0
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE0
V2C: Visual Voice Cloning0
Voice Adaptation for Swiss German0
VoiceCloak: A Multi-Dimensional Defense Framework against Unauthorized Diffusion-based Voice Cloning0
Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning0
Voice Cloning: Comprehensive Survey0
VoiceMark: Zero-Shot Voice Cloning-Resistant Watermarking Approach Leveraging Speaker-Specific Latents0
Xiaomingbot: A Multilingual Robot News Reporter0
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention0
Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems0
Voice Cloning for Dysarthric Speech Synthesis: Addressing Data Scarcity in Speech-Language Pathology0
Adapting TTS models For New Speakers using Transfer Learning0
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis0
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset0
Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language0
AI based Presentation Creator With Customized Audio Content Delivery0
Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems0
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection0
Can DeepFake Speech be Reliably Detected?0
CloneShield: A Framework for Universal Perturbation Against Zero-Shot Voice Cloning0
Collaborative Watermarking for Adversarial Speech Synthesis0
Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers0
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge0
Data Efficient Voice Cloning for Neural Singing Synthesis0
De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks0
Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust0
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis0
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing0
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis0
Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space0
Evaluating Voice Conversion-based Privacy Protection against Informed Attackers0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.