Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements Apr 27, 2025 Generative Adversarial Network Speech Synthesis
— Unverified 0Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data Feb 29, 2024 Representation Learning Speech Synthesis
— Unverified 0Generative Pre-training for Speech with Flow Matching Oct 25, 2023 Speech Enhancement Speech Synthesis
— Unverified 0F0 Modeling In Hmm-Based Speech Synthesis System Using Deep Belief Network Feb 18, 2015 Clustering Speaker Verification
— Unverified 0Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization Oct 30, 2018 Data Augmentation Disentanglement
— Unverified 0Face-StyleSpeech: Enhancing Zero-shot Speech Synthesis from Face Images with Improved Face-to-Speech Mapping Sep 25, 2023 Speech Synthesis text-to-speech
— Unverified 0Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech Sep 24, 2024 Emotional Speech Synthesis Speech Synthesis
— Unverified 0FADEL: Uncertainty-aware Fake Audio Detection with Evidential Deep Learning Apr 22, 2025 Deep Learning Speaker Verification
— Unverified 0FA-GAN: Artifacts-free and Phase-aware High-fidelity GAN-based Vocoder Jul 5, 2024 Generative Adversarial Network Speech Synthesis
— Unverified 0Collaborative Watermarking for Adversarial Speech Synthesis Sep 26, 2023 Speaker Verification Speech Synthesis
— Unverified 0fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit Nov 1, 2021 Speech Synthesis text-to-speech
— Unverified 0Fast and Accurate Decision Trees for Natural Language Processing Tasks Sep 1, 2017 Attribute BIG-bench Machine Learning
— Unverified 0Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS Oct 9, 2024 Diversity Speech Synthesis
— Unverified 0Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages Feb 13, 2023 Speech Synthesis text-to-speech
— Unverified 0Fast Bootstrapping of Grapheme to Phoneme System for Under-resourced Languages - Application to the Iban Language Oct 1, 2013 Speech Recognition Speech Synthesis
— Unverified 0Fast, Compact, and High Quality LSTM-RNN Based Statistical Parametric Speech Synthesizers for Mobile Devices Jun 20, 2016 Quantization Speech Synthesis
— Unverified 0BAD: An Assistant tool for making verses in Basque Apr 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention Oct 15, 2021 Simultaneous Speech-to-Speech Translation Speech Synthesis
— Unverified 0An In-depth Analysis of the Effect of Text Normalization in Social Media May 1, 2015 Dependency Parsing named-entity-recognition
— Unverified 0Fast Labeling and Transcription with the Speechalyzer Toolkit May 1, 2012 Audio Classification Benchmarking
— Unverified 0Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters Jun 19, 2021 Speech Synthesis text-to-speech
— Unverified 0Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks Aug 20, 2018 speech-recognition Speech Recognition
— Unverified 0A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis Oct 6, 2015 Speech Synthesis Vocal Bursts Intensity Prediction
— Unverified 0AV-Flow: Transforming Text to Audio-Visual Human-like Interactions Feb 18, 2025 Speech Synthesis
— Unverified 0An explainability study of the constant Q cepstral coefficient spoofing countermeasure for automatic speaker verification Apr 19, 2020 Speaker Verification Speech Synthesis
— Unverified 0Accurate synthesis of Dysarthric Speech for ASR data augmentation Aug 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis Dec 14, 2020 Cultural Vocal Bursts Intensity Prediction Speech Synthesis
— Unverified 0Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis Jun 15, 2023 Denoising Speech Synthesis
— Unverified 0Fine-grained Noise Control for Multispeaker Speech Synthesis Apr 11, 2022 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Diff-TTS: A Denoising Diffusion Model for Text-to-Speech Apr 3, 2021 Denoising GPU
— Unverified 0Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Nov 8, 2020 Disentanglement Speech Synthesis
— Unverified 0Fitting New Speakers Based on a Short Untranscribed Sample Feb 20, 2018 Speech Synthesis text-to-speech
— Unverified 0An Expert System for Automatic Reading of A Text Written in Standard Arabic May 8, 2014 Speech Synthesis text-to-speech
— Unverified 0Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features Apr 8, 2021 Decoder Speech Synthesis
— Unverified 0FleSpeech: Flexibly Controllable Speech Generation with Various Prompts Jan 8, 2025 Speech Synthesis
— Unverified 0An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis Dec 8, 2023 Benchmarking Quantization
— Unverified 0DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs Jan 28, 2022 Denoising Speech Synthesis
— Unverified 0FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis Jun 30, 2024 CPU Decoder
— Unverified 0AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable Duration Modeling Mar 21, 2022 Decoder Speech Synthesis
— Unverified 0A Variational EM Method for Pole-Zero Modeling of Speech with Mixed Block Sparse and Gaussian Excitation Jun 24, 2017 speech-recognition Speech Recognition
— Unverified 0DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models Feb 27, 2025 Diversity Language Modeling
— Unverified 0Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis Jul 18, 2018 Acoustic Modelling Decoder
— Unverified 0FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model Mar 6, 2023 Language Modeling Language Modelling
— Unverified 0From `Solved Problems' to New Challenges: A Report on LDC Activities May 1, 2018 Dialogue Management Language Identification
— Unverified 0From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech Mar 21, 2025 Speech Synthesis
— Unverified 0From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars Jun 16, 2025 GPU Speech Synthesis
— Unverified 0From Speaker Identification to Affective Analysis: A Multi-Step System for Analyzing Children's Stories Apr 1, 2014 Age Estimation Speaker Identification
— Unverified 0DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech May 26, 2025 Attribute Emotional Speech Synthesis
— Unverified 0Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement Aug 27, 2021 Audio Signal Processing Speech Enhancement
— Unverified 0AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis Apr 14, 2025 RAG Retrieval-augmented Generation
— Unverified 0