Can DeepFake Speech be Reliably Detected? Oct 9, 2024 Face Swapping Misinformation
— Unverified 0Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems Oct 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Augmentation through Laundering Attacks for Audio Spoof Detection Oct 1, 2024 Data Augmentation Face Swapping
— Unverified 0Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space Sep 19, 2024 Automatic Speech Recognition Data Augmentation
— Unverified 0Multi-modal Adversarial Training for Zero-Shot Voice Cloning Aug 28, 2024 Decoder text-to-speech
— Unverified 0Is Audio Spoof Detection Robust to Laundering Attacks? Aug 27, 2024 Voice Cloning
Code Code Available 0kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech Aug 20, 2024 Retrieval Self-Supervised Learning
— Unverified 0Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language Aug 19, 2024 Transfer Learning Voice Cloning
— Unverified 0WavLM model ensemble for audio deepfake detection Aug 14, 2024 Audio Deepfake Detection Data Augmentation
Code Code Available 0Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems Jul 18, 2024 Speech-to-Speech Translation Voice Cloning
— Unverified 0A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge Jun 22, 2024 Speech Synthesis text-to-speech
— Unverified 0DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing Jun 13, 2024 Language Modeling Language Modelling
— Unverified 0Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech Jun 11, 2024 speech-recognition Speech Recognition
— Unverified 0Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Jun 11, 2024 Ethics Fairness
— Unverified 0Non-autoregressive real-time Accent Conversion model with voice cloning May 21, 2024 Speech Enhancement speech-recognition
— Unverified 0PolyGlotFake: A Novel Multilingual and Multimodal DeepFake Dataset May 14, 2024 DeepFake Detection Face Swapping
Code Code Available 0MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech Feb 14, 2024 Decoder GPU
— Unverified 0Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages Jan 24, 2024 Voice Cloning
— Unverified 0Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis Jan 22, 2024 Speaker Verification Speech Synthesis
— Unverified 0MemoryCompanion: A Smart Healthcare Solution to Empower Efficient Alzheimer's Care Via Unleashing Generative AI Nov 20, 2023 Chatbot Prompt Engineering
— Unverified 0Learning Through AI-Clones: Enhancing Self-Perception and Presentation Performance Oct 23, 2023 Face Swapping Voice Cloning
— Unverified 0High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models Sep 27, 2023 All Speech Synthesis
— Unverified 0Collaborative Watermarking for Adversarial Speech Synthesis Sep 26, 2023 Speaker Verification Speech Synthesis
— Unverified 0TRAVID: An End-to-End Video Translation Framework Sep 20, 2023 Translation Voice Cloning
— Unverified 0Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion Aug 24, 2023 Audio Classification Binary Classification
— Unverified 0Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis Jul 14, 2023 In-Context Learning Language Modelling
— Unverified 0Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System Nov 1, 2022 Face Generation Speech Synthesis
— Unverified 0Low-Resource Multilingual and Zero-Shot Multispeaker TTS Oct 21, 2022 Meta-Learning text-to-speech
Code Code Available 0Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis Oct 14, 2022 Speech Synthesis Voice Cloning
Code Code Available 0Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS) Jul 4, 2022 Speech Synthesis text-to-speech
— Unverified 0Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE Jun 6, 2022 Representation Learning Speech Representation Learning
— Unverified 0Dictionary Attacks on Speaker Verification Apr 24, 2022 Speaker Verification Voice Cloning
Code Code Available 0Self-supervised learning for robust voice cloning Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 0Improve few-shot voice cloning using multi-modal learning Mar 18, 2022 text-to-speech Text to Speech
— Unverified 0Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention Jan 25, 2022 Form Speech Synthesis
— Unverified 0V2C: Visual Voice Cloning Nov 25, 2021 Voice Cloning
— Unverified 0Meta-Voice: Fast few-shot style transfer for expressive voice cloning using meta learning Nov 14, 2021 Disentanglement Meta-Learning
— Unverified 0SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Nov 6, 2021 Disentanglement Speaker Verification
Code Code Available 0Revisiting IPA-based Cross-lingual Text-to-speech Oct 14, 2021 text-to-speech Text to Speech
— Unverified 0Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data Oct 14, 2021 text-to-speech Text to Speech
— Unverified 0Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech Oct 14, 2021 Disentanglement text-to-speech
— Unverified 0Adapting TTS models For New Speakers using Transfer Learning Oct 12, 2021 text-to-speech Text to Speech
— Unverified 0Discovery of Single Independent Latent Variable Oct 12, 2021 Image Generation Voice Cloning
Code Code Available 0Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Jul 19, 2021 Data Augmentation Decoder
— Unverified 0AI based Presentation Creator With Customized Audio Content Delivery Jun 27, 2021 Generative Adversarial Network Voice Cloning
— Unverified 0Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Jun 25, 2021 Quantization Speaker anonymization
— Unverified 0The AS-NU System for the M2VoC Challenge Apr 7, 2021 Voice Cloning
— Unverified 0The Multi-speaker Multi-style Voice Cloning Challenge 2021 Apr 5, 2021 Benchmarking Voice Cloning
— Unverified 0CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge Mar 8, 2021 Voice Cloning
— Unverified 0Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech Mar 6, 2021 text-to-speech Text to Speech
Code Code Available 0