SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 4150 of 112 papers

TitleStatusHype
Is Audio Spoof Detection Robust to Laundering Attacks?Code0
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech0
Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language0
WavLM model ensemble for audio deepfake detectionCode0
Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems0
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic TokensCode11
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMsCode11
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge0
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing0
Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech0
Show:102550
← PrevPage 5 of 12Next →

No leaderboard results yet.