Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset Dec 25, 2024 text-to-speech Text to Speech
— Unverified 0Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language Aug 19, 2024 Transfer Learning Voice Cloning
— Unverified 0AI based Presentation Creator With Customized Audio Content Delivery Jun 27, 2021 Generative Adversarial Network Voice Cloning
— Unverified 0Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems Oct 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge Jun 22, 2024 Speech Synthesis text-to-speech
— Unverified 0Augmentation through Laundering Attacks for Audio Spoof Detection Oct 1, 2024 Data Augmentation Face Swapping
— Unverified 0Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection May 22, 2025 DeepFake Detection Face Swapping
— Unverified 0Can DeepFake Speech be Reliably Detected? Oct 9, 2024 Face Swapping Misinformation
— Unverified 0CloneShield: A Framework for Universal Perturbation Against Zero-Shot Voice Cloning May 25, 2025 text-to-speech Text to Speech
— Unverified 0Collaborative Watermarking for Adversarial Speech Synthesis Sep 26, 2023 Speaker Verification Speech Synthesis
— Unverified 0Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers Nov 26, 2019 Speech Synthesis text-to-speech
— Unverified 0CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge Mar 8, 2021 Voice Cloning
— Unverified 0Data Efficient Voice Cloning for Neural Singing Synthesis Feb 19, 2019 text-to-speech Text to Speech
— Unverified 0De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks Jul 3, 2025 Voice Cloning
— Unverified 0Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust Jan 24, 2025 Face Swapping Misinformation
— Unverified 0DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis Oct 14, 2024 Denoising Speaker Verification
— Unverified 0DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing Jun 13, 2024 Language Modeling Language Modelling
— Unverified 0Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis Apr 10, 2025 Speech Synthesis text-to-speech
— Unverified 0The Multi-speaker Multi-style Voice Cloning Challenge 2021 Apr 5, 2021 Benchmarking Voice Cloning
— Unverified 0Learning Through AI-Clones: Enhancing Self-Perception and Presentation Performance Oct 23, 2023 Face Swapping Voice Cloning
— Unverified 0Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement Jan 15, 2025 Computational Efficiency CPU
— Unverified 0Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Jul 19, 2021 Data Augmentation Decoder
— Unverified 0TRAVID: An End-to-End Video Translation Framework Sep 20, 2023 Translation Voice Cloning
— Unverified 0Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE Jun 6, 2022 Representation Learning Speech Representation Learning
— Unverified 0V2C: Visual Voice Cloning Nov 25, 2021 Voice Cloning
— Unverified 0Voice Adaptation for Swiss German May 28, 2025 Voice Cloning
— Unverified 0VoiceCloak: A Multi-Dimensional Defense Framework against Unauthorized Diffusion-based Voice Cloning May 18, 2025 Representation Learning Voice Cloning
— Unverified 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Voice Cloning: Comprehensive Survey May 1, 2025 Survey Voice Cloning
— Unverified 0VoiceMark: Zero-Shot Voice Cloning-Resistant Watermarking Approach Leveraging Speaker-Specific Latents May 27, 2025 Voice Cloning
— Unverified 0Xiaomingbot: A Multilingual Robot News Reporter Jul 12, 2020 Articles News Generation
— Unverified 0a novel cross-lingual voice cloning approach with a few text-free samples Oct 29, 2019 Translation Voice Cloning
— Unverified 0Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Jun 25, 2021 Quantization Speaker anonymization
— Unverified 0Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems Jul 18, 2024 Speech-to-Speech Translation Voice Cloning
— Unverified 0Pronunciation Deviation Analysis Through Voice Cloning and Acoustic Comparison Jul 15, 2025 Voice Cloning
— Unverified 0Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion Aug 24, 2023 Audio Classification Binary Classification
— Unverified 0Revisiting IPA-based Cross-lingual Text-to-speech Oct 14, 2021 text-to-speech Text to Speech
— Unverified 0Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages Jan 24, 2024 Voice Cloning
— Unverified 0Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Feb 18, 2019 Speech Synthesis Voice Cloning
— Unverified 0Self-supervised learning for robust voice cloning Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 0SoK: How Robust is Audio Watermarking in Generative AI models? Mar 24, 2025 Voice Cloning
— Unverified 0Speech Watermarking with Discrete Intermediate Representations Dec 18, 2024 Voice Cloning
— Unverified 0Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech Jun 11, 2024 speech-recognition Speech Recognition
— Unverified 0kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech Aug 20, 2024 Retrieval Self-Supervised Learning
— Unverified 0Steganography Beyond Space-Time with Chain of Multimodal AI Feb 25, 2025 Face Swapping Text Generation
— Unverified 0Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System Nov 1, 2022 Face Generation Speech Synthesis
— Unverified 0The AS-NU System for the M2VoC Challenge Apr 7, 2021 Voice Cloning
— Unverified 0The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings Oct 31, 2024 Voice Cloning
— Unverified 0PolyGlotFake: A Novel Multilingual and Multimodal DeepFake Dataset May 14, 2024 DeepFake Detection Face Swapping
Code Code Available 0Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis Oct 14, 2022 Speech Synthesis Voice Cloning
Code Code Available 0