An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era Oct 6, 2022 Speech Synthesis text-to-speech
— Unverified 0A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality Apr 5, 2022 Benchmarking Self-Supervised Learning
— Unverified 0Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks Oct 26, 2022 Image Captioning Language Modeling
— Unverified 0A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions Jun 4, 2025 Data Augmentation Diversity
— Unverified 0完全基於類神經網路之語音合成系統初步研究 (A Preliminary Study on Fully Neural Network-based Speech Synthesis System) [In Chinese] Nov 1, 2017 Speech Synthesis
— Unverified 0DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis Sep 22, 2023 Denoising Speech Synthesis
— Unverified 0DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis Oct 17, 2024 Speech Synthesis text-to-speech
— Unverified 0A Non-autoregressive Model for Joint STT and TTS Jan 15, 2025 Automatic Speech Recognition speech-recognition
— Unverified 0Adversarial Attacks and Robust Defenses in Speaker Embedding based Zero-Shot Text-to-Speech System Oct 5, 2024 Adversarial Purification Speech Synthesis
— Unverified 0Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback Jun 2, 2024 Speech Synthesis text-to-speech
— Unverified 0Duration Modeling by Multi-Models based on Vowel Production characteristics Dec 1, 2014 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Dual Script E2E framework for Multilingual and Code-Switching ASR Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design Feb 6, 2023 Drug Discovery Learning Theory
— Unverified 0DTW-SiameseNet: Dynamic Time Warped Siamese Network for Mispronunciation Detection and Correction Mar 1, 2023 Dynamic Time Warping Metric Learning
— Unverified 0DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech Jun 25, 2023 Speech Synthesis text-to-speech
— Unverified 0Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR Oct 16, 2024 Denoising Speech Synthesis
— Unverified 0An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis Jun 3, 2021 Speaker Verification Speech Synthesis
— Unverified 0DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis May 14, 2025 Audio Generation Audio Synthesis
— Unverified 0Do Prosody Transfer Models Transfer Prosody? Mar 7, 2023 Speech Synthesis text-to-speech
— Unverified 0On Error Propagation of Diffusion Models Aug 9, 2023 Denoising Image Generation
— Unverified 0Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection Dec 2, 2019 Speech Synthesis text-to-speech
— Unverified 0BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models May 28, 2025 Speech Synthesis
— Unverified 0DNN Filter Bank Cepstral Coefficients for Spoofing Detection Feb 13, 2017 Speaker Verification Speech Synthesis
— Unverified 0ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis Jan 16, 2024 Denoising Emotional Speech Synthesis
— Unverified 0BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Jul 4, 2022 Language Modeling Language Modelling
— Unverified 0An Objective Evaluation Framework for Pathological Speech Synthesis Jul 1, 2021 Speech Synthesis Voice Conversion
— Unverified 0Advancing Speech Synthesis using EEG Apr 9, 2020 EEG Electroencephalogram (EEG)
— Unverified 0Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment Oct 28, 2019 Hard Attention Speech Synthesis
— Unverified 0Effect of data reduction on sequence-to-sequence neural TTS Nov 15, 2018 Speech Synthesis
— Unverified 0Efficient Generative Modeling with Residual Vector Quantization-Based Tokens Dec 13, 2024 Conditional Image Generation Image Generation
— Unverified 0DNN-based Speech Synthesis Using Abundant Tags of Spontaneous Speech Corpus May 1, 2020 Speech Synthesis
— Unverified 0DNN-based Speech Synthesis for Indian Languages from ASCII text Aug 18, 2016 Speech Synthesis text-to-speech
— Unverified 0DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis Jul 19, 2019 Speech Synthesis
— Unverified 0Bayesian Subspace HMM for the Zerospeech 2020 Challenge May 19, 2020 Speech Synthesis
— Unverified 0結合ANN、全域變異數與真實軌跡挑選之基週軌跡產生方法(A Pitch-contour Generation Method Combining ANN Prediction,Global Variance Matching, and Real-contour Selection)[In Chinese] Oct 1, 2015 Speech Synthesis
— Unverified 0DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis Oct 14, 2024 Denoising Speaker Verification
— Unverified 0ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks May 27, 2019 Domain Adaptation Generative Adversarial Network
— Unverified 0Eigenresiduals for improved Parametric Speech Synthesis Jan 2, 2020 Speech Synthesis
— Unverified 0An Initial study on Birdsong Re-synthesis Using Neural Vocoders Sep 21, 2022 Resynthesis Speech Synthesis
— Unverified 0Enhancing Kurdish Text-to-Speech with Native Corpus Training: A High-Quality WaveGlow Vocoder Approach Sep 10, 2024 Speech Synthesis text-to-speech
— Unverified 0Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis Jan 22, 2024 Speaker Verification Speech Synthesis
— Unverified 0Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM Feb 24, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis Sep 27, 2024 Speech Synthesis
— Unverified 0A Challenge Set and Methods for Noun-Verb Ambiguity Oct 1, 2018 Speech Synthesis text-to-speech
— Unverified 0Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition Oct 26, 2020 Emotion Recognition Speech Emotion Recognition
— Unverified 0Energy-Based Models For Speech Synthesis Oct 19, 2023 Speech Synthesis
— Unverified 0EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model Jun 17, 2021 Emotional Speech Synthesis Emotion Classification
— Unverified 0EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System Jun 26, 2018 Emotional Speech Synthesis Parameter Prediction
— Unverified 0Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis Dec 1, 2014 Speech Synthesis text-to-speech
— Unverified 0Enhancing audio quality for expressive Neural Text-to-Speech Aug 13, 2021 Acoustic Modelling Speech Synthesis
— Unverified 0