SOTAVerified

Voice Cloning

Voice cloning is a highly desired feature for personalized speech interfaces. Neural voice cloning system learns to synthesize a person’s voice from only a few audio samples.

Papers

Showing 5175 of 112 papers

TitleStatusHype
Can DeepFake Speech be Reliably Detected?0
Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space0
Multi-modal Adversarial Training for Zero-Shot Voice Cloning0
Is Audio Spoof Detection Robust to Laundering Attacks?Code0
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech0
Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language0
WavLM model ensemble for audio deepfake detectionCode0
Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems0
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge0
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing0
Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech0
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices0
Non-autoregressive real-time Accent Conversion model with voice cloning0
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake DatasetCode0
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech0
Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages0
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis0
MemoryCompanion: A Smart Healthcare Solution to Empower Efficient Alzheimer's Care Via Unleashing Generative AI0
Learning Through AI-Clones: Enhancing Self-Perception and Presentation Performance0
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models0
Collaborative Watermarking for Adversarial Speech Synthesis0
TRAVID: An End-to-End Video Translation Framework0
Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion0
Show:102550
← PrevPage 3 of 5Next →

No leaderboard results yet.