SOTAVerified|Agents Browse Leaderboard About

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 112 papers

Title	Date	Tasks	Status	Hype
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices	Jun 11, 2024	EthicsFairness	—Unverified	0
XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model	Jun 7, 2024	text-to-speechText to Speech	CodeCode Available	1
Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis	Jun 6, 2024	DecoderInductive Bias	CodeCode Available	2
Non-autoregressive real-time Accent Conversion model with voice cloning	May 21, 2024	Speech Enhancementspeech-recognition	—Unverified	0
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake Dataset	May 14, 2024	DeepFake DetectionFace Swapping	CodeCode Available	0
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing	Feb 20, 2024	Voice Cloning	CodeCode Available	2
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech	Feb 14, 2024	DecoderGPU	—Unverified	0
Proactive Detection of Voice Cloning with Localized Watermarking	Jan 30, 2024	Voice Cloning	CodeCode Available	4
Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages	Jan 24, 2024	Voice Cloning	—Unverified	0
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis	Jan 22, 2024	Speaker VerificationSpeech Synthesis	—Unverified	0
OpenVoice: Versatile Instant Voice Cloning	Dec 3, 2023	RhythmVoice Cloning	CodeCode Available	7
MemoryCompanion: A Smart Healthcare Solution to Empower Efficient Alzheimer's Care Via Unleashing Generative AI	Nov 20, 2023	ChatbotPrompt Engineering	—Unverified	0
Learning Through AI-Clones: Enhancing Self-Perception and Presentation Performance	Oct 23, 2023	Face SwappingVoice Cloning	—Unverified	0
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models	Sep 27, 2023	AllSpeech Synthesis	—Unverified	0
Collaborative Watermarking for Adversarial Speech Synthesis	Sep 26, 2023	Speaker VerificationSpeech Synthesis	—Unverified	0
TRAVID: An End-to-End Video Translation Framework	Sep 20, 2023	TranslationVoice Cloning	—Unverified	0
Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion	Aug 24, 2023	Audio ClassificationBinary Classification	—Unverified	0
Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques	Aug 5, 2023	QuantizationSpeaker anonymization	CodeCode Available	1
Single and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features	Jul 15, 2023	Voice Cloning	CodeCode Available	1
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis	Jul 14, 2023	In-Context LearningLanguage Modelling	—Unverified	0
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert	Apr 18, 2023	Audio GenerationExpressive Speech Synthesis	CodeCode Available	4
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech	Nov 7, 2022	Representation LearningSpeech Representation Learning	CodeCode Available	6
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System	Nov 1, 2022	Face GenerationSpeech Synthesis	—Unverified	0
Low-Resource Multilingual and Zero-Shot Multispeaker TTS	Oct 21, 2022	Meta-Learningtext-to-speech	CodeCode Available	0
Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis	Oct 14, 2022	Speech SynthesisVoice Cloning	CodeCode Available	0

Show:10 25 50

← PrevPage 3 of 5Next →

No leaderboard results yet.