Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis Feb 6, 2020 Disentanglement Speech Synthesis
— Unverified 00 Fully Unsupervised Training of Few-shot Keyword Spotting Oct 6, 2022 Keyword Spotting Metric Learning
— Unverified 00 Empirical Analysis of Oral and Nasal Vowels of Konkani May 17, 2023 Speech Synthesis
— Unverified 00 Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis Dec 1, 2014 Speech Synthesis text-to-speech
— Unverified 00 GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis Jun 29, 2021 Speech Synthesis text-to-speech
— Unverified 00 GANtron: Emotional Speech Synthesis with Generative Adversarial Networks Oct 6, 2021 Emotional Speech Synthesis Speech Synthesis
— Unverified 00 Building Text-to-Speech Systems for Resource Poor Languages May 1, 2012 Clustering Speech Synthesis
— Unverified 00 Applying Automated Machine Translation to Educational Video Courses Jan 9, 2023 Machine Translation Speech Synthesis
— Unverified 00 Gender Bias in Instruction-Guided Speech Synthesis Models Feb 8, 2025 Expressive Speech Synthesis Speech Synthesis
— Unverified 00 Generacion de voces artificiales infantiles en castellano con acento costarricense Feb 2, 2021 Speech Synthesis
— Unverified 00 A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis Apr 7, 2018 Speech Synthesis
— Unverified 00 EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System Jun 26, 2018 Emotional Speech Synthesis Parameter Prediction
— Unverified 00 EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model Jun 17, 2021 Emotional Speech Synthesis Emotion Classification
— Unverified 00 Building Synthetic Voices in the META-NET Framework May 1, 2012 Speech Synthesis Voice Conversion
— Unverified 00 Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition Oct 26, 2020 Emotion Recognition Speech Emotion Recognition
— Unverified 00 Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments Jun 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture Apr 12, 2022 Speech Synthesis
— Unverified 00 EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis Sep 27, 2024 Speech Synthesis
— Unverified 00 Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech May 1, 2018 Automatic Speech Recognition (ASR) Speech Recognition
— Unverified 00 Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization Sep 16, 2024 Emotional Speech Synthesis In-Context Learning
— Unverified 00 Building A User-Centric and Content-Driven Socialbot May 6, 2020 Articles Management
— Unverified 00 Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS Aug 3, 2023 Denoising Speech Synthesis
— Unverified 00 Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis May 1, 2012 Audio-Visual Speech Recognition Speech Recognition
— Unverified 00 Eigenresiduals for improved Parametric Speech Synthesis Jan 2, 2020 Speech Synthesis
— Unverified 00 ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks May 27, 2019 Domain Adaptation Generative Adversarial Network
— Unverified 00 Building and using language resources and infrastructure to develop e-learning programs for a minority language May 1, 2017 Language Acquisition Speech Synthesis
— Unverified 00 基於字元階層之語音合成用文脈訊息擷取 (Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese] Dec 1, 2016 Feature Engineering Speech Synthesis
— Unverified 00 Efficient training strategies for natural sounding speech synthesis and speaker adaptation based on FastPitch Oct 9, 2024 Speech Synthesis text-to-speech
— Unverified 00 Efficient neural speech synthesis for low-resource languages through multilingual modeling Aug 20, 2020 Speech Synthesis
— Unverified 00 BUCEADOR, a multi-language search engine for digital libraries May 1, 2012 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models Nov 17, 2022 Speech Synthesis text-to-speech
— Unverified 00 Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise Apr 28, 2020 Clustering Data Augmentation
— Unverified 00 A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities Sep 1, 2015 Speech Synthesis
— Unverified 00 A Bengali HMM Based Speech Synthesis System Jun 16, 2014 Speech Synthesis text-to-speech
— Unverified 00 Voice Cloning for Dysarthric Speech Synthesis: Addressing Data Scarcity in Speech-Language Pathology Mar 3, 2025 Speech Synthesis Voice Cloning
— Unverified 00 Efficient Incremental Text-to-Speech on GPUs Nov 25, 2022 GPU Speech Synthesis
— Unverified 00 Efficient Generative Modeling with Residual Vector Quantization-Based Tokens Dec 13, 2024 Conditional Image Generation Image Generation
— Unverified 00 Effect of data reduction on sequence-to-sequence neural TTS Nov 15, 2018 Speech Synthesis
— Unverified 00 BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0 Dec 21, 2023 Speech Synthesis Transfer Learning
— Unverified 00 An Overview of BPPT's Indonesian Language Resources Dec 1, 2016 Machine Translation speech-recognition
— Unverified 00 Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment Oct 28, 2019 Hard Attention Speech Synthesis
— Unverified 00 Boosting Large Language Model for Speech Synthesis: An Empirical Study Dec 30, 2023 Language Modeling Language Modelling
— Unverified 00 Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion Jun 9, 2023 Denoising Speech Synthesis
— Unverified 00 An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era Oct 6, 2022 Speech Synthesis text-to-speech
— Unverified 00 ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis Jan 16, 2024 Denoising Emotional Speech Synthesis
— Unverified 00 Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks Oct 26, 2022 Image Captioning Language Modeling
— Unverified 00 Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection Dec 2, 2019 Speech Synthesis text-to-speech
— Unverified 00 BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models May 28, 2025 Speech Synthesis
— Unverified 00 A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions Jun 4, 2025 Data Augmentation Diversity
— Unverified 00 DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis Sep 22, 2023 Denoising Speech Synthesis
— Unverified 00