DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis Oct 17, 2024 Speech Synthesis text-to-speech
— Unverified 00 A Non-autoregressive Model for Joint STT and TTS Jan 15, 2025 Automatic Speech Recognition speech-recognition
— Unverified 00 Adversarial Attacks and Robust Defenses in Speaker Embedding based Zero-Shot Text-to-Speech System Oct 5, 2024 Adversarial Purification Speech Synthesis
— Unverified 00 A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality Apr 5, 2022 Benchmarking Self-Supervised Learning
— Unverified 00 Duration Modeling by Multi-Models based on Vowel Production characteristics Dec 1, 2014 Speech Synthesis Text-To-Speech Synthesis
— Unverified 00 Dual Script E2E framework for Multilingual and Code-Switching ASR Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design Feb 6, 2023 Drug Discovery Learning Theory
— Unverified 00 DTW-SiameseNet: Dynamic Time Warped Siamese Network for Mispronunciation Detection and Correction Mar 1, 2023 Dynamic Time Warping Metric Learning
— Unverified 00 DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech Jun 25, 2023 Speech Synthesis text-to-speech
— Unverified 00 Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR Oct 16, 2024 Denoising Speech Synthesis
— Unverified 00 An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis Jun 3, 2021 Speaker Verification Speech Synthesis
— Unverified 00 DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis May 14, 2025 Audio Generation Audio Synthesis
— Unverified 00 Do Prosody Transfer Models Transfer Prosody? Mar 7, 2023 Speech Synthesis text-to-speech
— Unverified 00 On Error Propagation of Diffusion Models Aug 9, 2023 Denoising Image Generation
— Unverified 00 DNN Filter Bank Cepstral Coefficients for Spoofing Detection Feb 13, 2017 Speaker Verification Speech Synthesis
— Unverified 00 BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Jul 4, 2022 Language Modeling Language Modelling
— Unverified 00 An Objective Evaluation Framework for Pathological Speech Synthesis Jul 1, 2021 Speech Synthesis Voice Conversion
— Unverified 00 Advancing Speech Synthesis using EEG Apr 9, 2020 EEG Electroencephalogram (EEG)
— Unverified 00 DNN-based Speech Synthesis Using Abundant Tags of Spontaneous Speech Corpus May 1, 2020 Speech Synthesis
— Unverified 00 DNN-based Speech Synthesis for Indian Languages from ASCII text Aug 18, 2016 Speech Synthesis text-to-speech
— Unverified 00 DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis Jul 19, 2019 Speech Synthesis
— Unverified 00 Bayesian Subspace HMM for the Zerospeech 2020 Challenge May 19, 2020 Speech Synthesis
— Unverified 00 結合ANN、全域變異數與真實軌跡挑選之基週軌跡產生方法(A Pitch-contour Generation Method Combining ANN Prediction,Global Variance Matching, and Real-contour Selection)[In Chinese] Oct 1, 2015 Speech Synthesis
— Unverified 00 DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis Oct 14, 2024 Denoising Speaker Verification
— Unverified 00 Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM Feb 24, 2025 Automatic Speech Recognition Language Modeling
— Unverified 00 An Initial study on Birdsong Re-synthesis Using Neural Vocoders Sep 21, 2022 Resynthesis Speech Synthesis
— Unverified 00 Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis Jan 22, 2024 Speaker Verification Speech Synthesis
— Unverified 00 A Challenge Set and Methods for Noun-Verb Ambiguity Oct 1, 2018 Speech Synthesis text-to-speech
— Unverified 00 完全基於類神經網路之語音合成系統初步研究 (A Preliminary Study on Fully Neural Network-based Speech Synthesis System) [In Chinese] Nov 1, 2017 Speech Synthesis
— Unverified 00 Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization Oct 30, 2018 Data Augmentation Disentanglement
— Unverified 00 Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS Oct 9, 2024 Diversity Speech Synthesis
— Unverified 00 Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention Oct 15, 2021 Simultaneous Speech-to-Speech Translation Speech Synthesis
— Unverified 00 BAD: An Assistant tool for making verses in Basque Apr 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 00 An In-depth Analysis of the Effect of Text Normalization in Social Media May 1, 2015 Dependency Parsing named-entity-recognition
— Unverified 00 A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis Oct 6, 2015 Speech Synthesis Vocal Bursts Intensity Prediction
— Unverified 00 AV-Flow: Transforming Text to Audio-Visual Human-like Interactions Feb 18, 2025 Speech Synthesis
— Unverified 00 An explainability study of the constant Q cepstral coefficient spoofing countermeasure for automatic speaker verification Apr 19, 2020 Speaker Verification Speech Synthesis
— Unverified 00 Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters Jun 19, 2021 Speech Synthesis text-to-speech
— Unverified 00 Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis Jun 15, 2023 Denoising Speech Synthesis
— Unverified 00 Diff-TTS: A Denoising Diffusion Model for Text-to-Speech Apr 3, 2021 Denoising GPU
— Unverified 00 An Expert System for Automatic Reading of A Text Written in Standard Arabic May 8, 2014 Speech Synthesis text-to-speech
— Unverified 00 DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs Jan 28, 2022 Denoising Speech Synthesis
— Unverified 00 AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable Duration Modeling Mar 21, 2022 Decoder Speech Synthesis
— Unverified 00 A Variational EM Method for Pole-Zero Modeling of Speech with Mixed Block Sparse and Gaussian Excitation Jun 24, 2017 speech-recognition Speech Recognition
— Unverified 00 DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models Feb 27, 2025 Diversity Language Modeling
— Unverified 00 DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech May 26, 2025 Attribute Emotional Speech Synthesis
— Unverified 00 AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis Apr 14, 2025 RAG Retrieval-augmented Generation
— Unverified 00 An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis Dec 8, 2023 Benchmarking Quantization
— Unverified 00 A distributed cloud-based dialog system for conversational application development Sep 1, 2015 Speech Recognition Speech Synthesis
— Unverified 00 Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network Oct 13, 2016 Decoder Speech Enhancement
— Unverified 00