SOTAVerified|Agents Browse Leaderboard About Blog

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 112 papers

Title	Date	Tasks	Status	Hype
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation	Feb 18, 2025	Voice Cloning	CodeCode Available	3
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction	Feb 17, 2025	Instruction FollowingVoice Cloning	CodeCode Available	7
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System	Feb 8, 2025	DecoderLanguage Modeling	CodeCode Available	11
Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust	Jan 24, 2025	Face SwappingMisinformation	—Unverified	0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement	Jan 15, 2025	Computational EfficiencyCPU	—Unverified	0
MARS6: A Small and Robust Hierarchical-Codec Text-to-Speech Model	Jan 10, 2025	DecoderLanguage Modelling	—Unverified	0
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset	Dec 25, 2024	text-to-speechText to Speech	—Unverified	0
Speech Watermarking with Discrete Intermediate Representations	Dec 18, 2024	Voice Cloning	—Unverified	0
Parallel Stacked Aggregated Network for Voice Authentication in IoT-Enabled Smart Devices	Nov 29, 2024	Voice Anti-spoofingVoice Cloning	—Unverified	0
Hindi audio-video-Deepfake (HAV-DF): A Hindi language-based Audio-video Deepfake Dataset	Nov 23, 2024	DeepFake DetectionFace Swapping	—Unverified	0

Show:10 25 50

← PrevPage 3 of 12Next →

No leaderboard results yet.