Revisiting IPA-based Cross-lingual Text-to-speech Oct 14, 2021 text-to-speech Text to Speech
— Unverified 00 Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages Jan 24, 2024 Voice Cloning
— Unverified 00 Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Feb 18, 2019 Speech Synthesis Voice Cloning
— Unverified 00 Self-supervised learning for robust voice cloning Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 00 SoK: How Robust is Audio Watermarking in Generative AI models? Mar 24, 2025 Voice Cloning
— Unverified 00 Speech Watermarking with Discrete Intermediate Representations Dec 18, 2024 Voice Cloning
— Unverified 00 Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech Jun 11, 2024 speech-recognition Speech Recognition
— Unverified 00 kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech Aug 20, 2024 Retrieval Self-Supervised Learning
— Unverified 00 Steganography Beyond Space-Time with Chain of Multimodal AI Feb 25, 2025 Face Swapping Text Generation
— Unverified 00 Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System Nov 1, 2022 Face Generation Speech Synthesis
— Unverified 00 The AS-NU System for the M2VoC Challenge Apr 7, 2021 Voice Cloning
— Unverified 00 The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings Oct 31, 2024 Voice Cloning
— Unverified 00 The Multi-speaker Multi-style Voice Cloning Challenge 2021 Apr 5, 2021 Benchmarking Voice Cloning
— Unverified 00 Learning Through AI-Clones: Enhancing Self-Perception and Presentation Performance Oct 23, 2023 Face Swapping Voice Cloning
— Unverified 00 Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement Jan 15, 2025 Computational Efficiency CPU
— Unverified 00 Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Jul 19, 2021 Data Augmentation Decoder
— Unverified 00 TRAVID: An End-to-End Video Translation Framework Sep 20, 2023 Translation Voice Cloning
— Unverified 00 Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE Jun 6, 2022 Representation Learning Speech Representation Learning
— Unverified 00 V2C: Visual Voice Cloning Nov 25, 2021 Voice Cloning
— Unverified 00 Voice Adaptation for Swiss German May 28, 2025 Voice Cloning
— Unverified 00 VoiceCloak: A Multi-Dimensional Defense Framework against Unauthorized Diffusion-based Voice Cloning May 18, 2025 Representation Learning Voice Cloning
— Unverified 00 Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 00 Voice Cloning: Comprehensive Survey May 1, 2025 Survey Voice Cloning
— Unverified 00 VoiceMark: Zero-Shot Voice Cloning-Resistant Watermarking Approach Leveraging Speaker-Specific Latents May 27, 2025 Voice Cloning
— Unverified 00 Xiaomingbot: A Multilingual Robot News Reporter Jul 12, 2020 Articles News Generation
— Unverified 00 Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention Jan 25, 2022 Form Speech Synthesis
— Unverified 00 Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems Jul 18, 2024 Speech-to-Speech Translation Voice Cloning
— Unverified 00 Voice Cloning for Dysarthric Speech Synthesis: Addressing Data Scarcity in Speech-Language Pathology Mar 3, 2025 Speech Synthesis Voice Cloning
— Unverified 00 Adapting TTS models For New Speakers using Transfer Learning Oct 12, 2021 text-to-speech Text to Speech
— Unverified 00 Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis Jan 22, 2024 Speaker Verification Speech Synthesis
— Unverified 00 Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset Dec 25, 2024 text-to-speech Text to Speech
— Unverified 00 Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language Aug 19, 2024 Transfer Learning Voice Cloning
— Unverified 00 AI based Presentation Creator With Customized Audio Content Delivery Jun 27, 2021 Generative Adversarial Network Voice Cloning
— Unverified 00 Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems Oct 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge Jun 22, 2024 Speech Synthesis text-to-speech
— Unverified 00 Augmentation through Laundering Attacks for Audio Spoof Detection Oct 1, 2024 Data Augmentation Face Swapping
— Unverified 00 Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection May 22, 2025 DeepFake Detection Face Swapping
— Unverified 00 Can DeepFake Speech be Reliably Detected? Oct 9, 2024 Face Swapping Misinformation
— Unverified 00 CloneShield: A Framework for Universal Perturbation Against Zero-Shot Voice Cloning May 25, 2025 text-to-speech Text to Speech
— Unverified 00 Collaborative Watermarking for Adversarial Speech Synthesis Sep 26, 2023 Speaker Verification Speech Synthesis
— Unverified 00 Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers Nov 26, 2019 Speech Synthesis text-to-speech
— Unverified 00 CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge Mar 8, 2021 Voice Cloning
— Unverified 00 Data Efficient Voice Cloning for Neural Singing Synthesis Feb 19, 2019 text-to-speech Text to Speech
— Unverified 00 De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks Jul 3, 2025 Voice Cloning
— Unverified 00 Deepfake Technology Unveiled: The Commoditization of AI and Its Impact on Digital Trust Jan 24, 2025 Face Swapping Misinformation
— Unverified 00 DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis Oct 14, 2024 Denoising Speaker Verification
— Unverified 00 DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing Jun 13, 2024 Language Modeling Language Modelling
— Unverified 00 Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis Apr 10, 2025 Speech Synthesis text-to-speech
— Unverified 00 Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space Sep 19, 2024 Automatic Speech Recognition Data Augmentation
— Unverified 00 Evaluating Voice Conversion-based Privacy Protection against Informed Attackers Nov 10, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00