SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 5160 of 112 papers

TitleStatusHype
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices0
XTTS: a Massively Multilingual Zero-Shot Text-to-Speech ModelCode1
Small-E: Small Language Model with Linear Attention for Efficient Speech SynthesisCode2
Non-autoregressive real-time Accent Conversion model with voice cloning0
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake DatasetCode0
StyleDubber: Towards Multi-Scale Style Learning for Movie DubbingCode2
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech0
Proactive Detection of Voice Cloning with Localized WatermarkingCode4
Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages0
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis0
Show:102550
← PrevPage 6 of 12Next →

No leaderboard results yet.