SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 2130 of 112 papers

TitleStatusHype
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song GenerationCode3
Step-Audio: Unified Understanding and Generation in Intelligent Speech InteractionCode7
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech SystemCode11
Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement0
MARS6: A Small and Robust Hierarchical-Codec Text-to-Speech Model0
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset0
Speech Watermarking with Discrete Intermediate Representations0
Parallel Stacked Aggregated Network for Voice Authentication in IoT-Enabled Smart Devices0
Hindi audio-video-Deepfake (HAV-DF): A Hindi language-based Audio-video Deepfake Dataset0
Show:102550
← PrevPage 3 of 12Next →

No leaderboard results yet.