Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices Jun 11, 2024 Ethics Fairness
— Unverified 0XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model Jun 7, 2024 text-to-speech Text to Speech
Code Code Available 1Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis Jun 6, 2024 Decoder Inductive Bias
Code Code Available 2Non-autoregressive real-time Accent Conversion model with voice cloning May 21, 2024 Speech Enhancement speech-recognition
— Unverified 0PolyGlotFake: A Novel Multilingual and Multimodal DeepFake Dataset May 14, 2024 DeepFake Detection Face Swapping
Code Code Available 0StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing Feb 20, 2024 Voice Cloning
Code Code Available 2MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech Feb 14, 2024 Decoder GPU
— Unverified 0Proactive Detection of Voice Cloning with Localized Watermarking Jan 30, 2024 Voice Cloning
Code Code Available 4Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages Jan 24, 2024 Voice Cloning
— Unverified 0Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis Jan 22, 2024 Speaker Verification Speech Synthesis
— Unverified 0OpenVoice: Versatile Instant Voice Cloning Dec 3, 2023 Rhythm Voice Cloning
Code Code Available 7MemoryCompanion: A Smart Healthcare Solution to Empower Efficient Alzheimer's Care Via Unleashing Generative AI Nov 20, 2023 Chatbot Prompt Engineering
— Unverified 0Learning Through AI-Clones: Enhancing Self-Perception and Presentation Performance Oct 23, 2023 Face Swapping Voice Cloning
— Unverified 0High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models Sep 27, 2023 All Speech Synthesis
— Unverified 0Collaborative Watermarking for Adversarial Speech Synthesis Sep 26, 2023 Speaker Verification Speech Synthesis
— Unverified 0TRAVID: An End-to-End Video Translation Framework Sep 20, 2023 Translation Voice Cloning
— Unverified 0Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion Aug 24, 2023 Audio Classification Binary Classification
— Unverified 0Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques Aug 5, 2023 Quantization Speaker anonymization
Code Code Available 1Single and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features Jul 15, 2023 Voice Cloning
Code Code Available 1Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis Jul 14, 2023 In-Context Learning Language Modelling
— Unverified 0Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert Apr 18, 2023 Audio Generation Expressive Speech Synthesis
Code Code Available 4ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech Nov 7, 2022 Representation Learning Speech Representation Learning
Code Code Available 6Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System Nov 1, 2022 Face Generation Speech Synthesis
— Unverified 0Low-Resource Multilingual and Zero-Shot Multispeaker TTS Oct 21, 2022 Meta-Learning text-to-speech
Code Code Available 0Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis Oct 14, 2022 Speech Synthesis Voice Cloning
Code Code Available 0Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS) Jul 4, 2022 Speech Synthesis text-to-speech
— Unverified 0Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE Jun 6, 2022 Representation Learning Speech Representation Learning
— Unverified 0Dictionary Attacks on Speaker Verification Apr 24, 2022 Speaker Verification Voice Cloning
Code Code Available 0Self-supervised learning for robust voice cloning Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 0Improve few-shot voice cloning using multi-modal learning Mar 18, 2022 text-to-speech Text to Speech
— Unverified 0Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention Jan 25, 2022 Form Speech Synthesis
— Unverified 0V2C: Visual Voice Cloning Nov 25, 2021 Voice Cloning
— Unverified 0Meta-Voice: Fast few-shot style transfer for expressive voice cloning using meta learning Nov 14, 2021 Disentanglement Meta-Learning
— Unverified 0SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Nov 6, 2021 Disentanglement Speaker Verification
Code Code Available 0Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech Oct 14, 2021 Disentanglement text-to-speech
— Unverified 0Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data Oct 14, 2021 text-to-speech Text to Speech
— Unverified 0Revisiting IPA-based Cross-lingual Text-to-speech Oct 14, 2021 text-to-speech Text to Speech
— Unverified 0Discovery of Single Independent Latent Variable Oct 12, 2021 Image Generation Voice Cloning
Code Code Available 0Adapting TTS models For New Speakers using Transfer Learning Oct 12, 2021 text-to-speech Text to Speech
— Unverified 0Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Jul 19, 2021 Data Augmentation Decoder
— Unverified 0AI based Presentation Creator With Customized Audio Content Delivery Jun 27, 2021 Generative Adversarial Network Voice Cloning
— Unverified 0Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text Jun 26, 2021 Talking Face Generation Talking Head Generation
Code Code Available 1Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Jun 25, 2021 Quantization Speaker anonymization
— Unverified 0Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss Apr 22, 2021 Voice Cloning Voice Conversion
Code Code Available 1The AS-NU System for the M2VoC Challenge Apr 7, 2021 Voice Cloning
— Unverified 0The Multi-speaker Multi-style Voice Cloning Challenge 2021 Apr 5, 2021 Benchmarking Voice Cloning
— Unverified 0CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge Mar 8, 2021 Voice Cloning
— Unverified 0Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech Mar 6, 2021 text-to-speech Text to Speech
Code Code Available 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Expressive Neural Voice Cloning Jan 30, 2021 Speech Synthesis Style Transfer
— Unverified 0