SOTAVerified|Agents Browse Leaderboard About Blog

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 112 papers

Title	Date	Tasks	Status
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis	Jul 14, 2023	In-Context LearningLanguage Modelling	—Unverified
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System	Nov 1, 2022	Face GenerationSpeech Synthesis	—Unverified
Low-Resource Multilingual and Zero-Shot Multispeaker TTS	Oct 21, 2022	Meta-Learningtext-to-speech	—Unverified
Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis	Oct 14, 2022	Speech SynthesisVoice Cloning	CodeCode Available
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS)	Jul 4, 2022	Speech Synthesistext-to-speech	—Unverified
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE	Jun 6, 2022	Representation LearningSpeech Representation Learning	—Unverified
Dictionary Attacks on Speaker Verification	Apr 24, 2022	Speaker VerificationVoice Cloning	CodeCode Available
Self-supervised learning for robust voice cloning	Apr 7, 2022	Self-Supervised LearningSpeech Synthesis	—Unverified
Improve few-shot voice cloning using multi-modal learning	Mar 18, 2022	text-to-speechText to Speech	—Unverified
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention	Jan 25, 2022	FormSpeech Synthesis	—Unverified
V2C: Visual Voice Cloning	Nov 25, 2021	Voice Cloning	—Unverified
Meta-Voice: Fast few-shot style transfer for expressive voice cloning using meta learning	Nov 14, 2021	DisentanglementMeta-Learning	—Unverified
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines	Nov 6, 2021	DisentanglementSpeaker Verification	CodeCode Available
Revisiting IPA-based Cross-lingual Text-to-speech	Oct 14, 2021	text-to-speechText to Speech	—Unverified
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data	Oct 14, 2021	text-to-speechText to Speech	—Unverified
Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech	Oct 14, 2021	Disentanglementtext-to-speech	—Unverified
Adapting TTS models For New Speakers using Transfer Learning	Oct 12, 2021	text-to-speechText to Speech	—Unverified
Discovery of Single Independent Latent Variable	Oct 12, 2021	Image GenerationVoice Cloning	CodeCode Available
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation	Jul 19, 2021	Data AugmentationDecoder	—Unverified
AI based Presentation Creator With Customized Audio Content Delivery	Jun 27, 2021	Generative Adversarial NetworkVoice Cloning	—Unverified
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance	Jun 25, 2021	QuantizationSpeaker anonymization	—Unverified
The AS-NU System for the M2VoC Challenge	Apr 7, 2021	Voice Cloning	—Unverified
The Multi-speaker Multi-style Voice Cloning Challenge 2021	Apr 5, 2021	BenchmarkingVoice Cloning	—Unverified
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge	Mar 8, 2021	Voice Cloning	—Unverified
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech	Mar 6, 2021	text-to-speechText to Speech	—Unverified

Show:10 25 50

← PrevPage 4 of 5Next →

No leaderboard results yet.