Whispered and Lombard Neural Speech Synthesis Jan 13, 2021 Speaker Verification Speech Synthesis
— Unverified 0Whither the Priors for (Vocal) Interactivity? Mar 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WinkTalk: a demonstration of a multimodal speech synthesis platform linking facial expressions to expressive synthetic voices Jun 1, 2012 Speech Synthesis
— Unverified 0WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis Jun 20, 2022 CPU Speech Synthesis
— Unverified 0Word-Level Style Control for Expressive, Non-attentive Speech Synthesis Nov 19, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 0應用文脈分析於中英夾雜語音合成系統(Linguistic Analysis for English/Mandarin Speech Synthesis System) Oct 1, 2019 Speech Synthesis
— Unverified 0You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation May 14, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention Jan 25, 2022 Form Speech Synthesis
— Unverified 0Zero-Shot Mono-to-Binaural Speech Synthesis Dec 11, 2024 Audio Synthesis Denoising
— Unverified 0Zero-shot personalized lip-to-speech synthesis with face image based voice control May 9, 2023 Lip to Speech Synthesis Representation Learning
— Unverified 0Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling May 26, 2025 Sentence Speech Synthesis
— Unverified 0Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model Apr 24, 2023 Rhythm Self-Supervised Learning
— Unverified 0ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models May 23, 2023 Speech Synthesis text-to-speech
— Unverified 0整合語者嵌入向量與後置濾波器於提升個人化合成語音之語者相似度 (Incorporating Speaker Embedding and Post-Filter Network for Improving Speaker Similarity of Personalized Speech Synthesis System) Dec 1, 2021 Speech Synthesis
— Unverified 0Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions Jun 3, 2025 Expressive Speech Synthesis Prompt Learning
— Unverified 0Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations Jun 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis Nov 19, 2021 Clustering Decoder
— Unverified 0Prosodic Prominence and Boundaries in Sequence-to-Sequence Speech Synthesis Jun 29, 2020 Sentence Speech Synthesis
— Unverified 0Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech Nov 4, 2020 Graph Attention Representation Learning
— Unverified 0ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis Dec 16, 2024 Speech Synthesis text-to-speech
— Unverified 0Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit Aug 13, 2020 Language Modeling Language Modelling
— Unverified 0Prosody-TTS: An end-to-end speech synthesis system with prosody control Oct 6, 2021 Rhythm Speech Synthesis
— Unverified 0Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis Apr 14, 2025 Language Modeling Language Modelling
— Unverified 0Punjabi Text-To-Speech Synthesis System Dec 1, 2012 Speech Synthesis text-to-speech
— Unverified 0PyDial: A Multi-domain Statistical Dialogue System Toolkit Jul 1, 2017 Dialogue Management Speech Recognition
— Unverified 0pyiwn: A Python based API to access Indian Language WordNets Jan 1, 2018 Speech Synthesis
— Unverified 0QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis Mar 14, 2023 Emotional Speech Synthesis Sentence
— Unverified 0Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective Sep 29, 2024 Audio-Visual Speech Recognition Lip Reading
— Unverified 0RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis Apr 4, 2024 Language Modeling Language Modelling
— Unverified 0RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations May 24, 2025 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Real-time Incremental Speech-to-Speech Translation of Dialogs Jun 1, 2012 Machine Translation Speech Recognition
— Unverified 0Real-Time Single-Speaker Taiwanese-Accented Mandarin Speech Synthesis System Sep 1, 2020 Speech Synthesis
— Unverified 0ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence May 9, 2022 Speech Synthesis text-to-speech
— Unverified 0Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis Jan 26, 2016 General Classification regression
— Unverified 0RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech Oct 26, 2022 Speech Synthesis
— Unverified 0Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis Sep 8, 2021 Expressive Speech Synthesis Sentence
— Unverified 0Refer-iTTS: A System for Referring in Spoken Installments to Objects in Real-World Images Sep 1, 2017 Referring Expression Referring expression generation
— Unverified 0Regeneration Learning: A Learning Paradigm for Data Generation Jan 21, 2023 Image Generation Representation Learning
— Unverified 0Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability Apr 3, 2021 Emotion Recognition reinforcement-learning
— Unverified 0DLPO: Diffusion Model Loss-Guided Reinforcement Learning for Fine-Tuning Text-to-Speech Diffusion Models May 23, 2024 Image Generation reinforcement-learning
— Unverified 0Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling Apr 1, 2024 Speaker Identification Speech Synthesis
— Unverified 0Replay Spoofing Countermeasure Using Autoencoder and Siamese Network on ASVspoof 2019 Challenge Oct 29, 2019 Speaker Verification Speech Synthesis
— Unverified 0Residual-guided Personalized Speech Synthesis based on Face Image Apr 1, 2022 Speech Synthesis
— Unverified 0Response Generation Based on Hierarchical Semantic Structure with POMDP Re-ranking for Conversational Dialogue Systems Oct 1, 2013 Dialogue Management Information Retrieval
— Unverified 0Retrieval-Augmented Audio Deepfake Detection Apr 22, 2024 Audio Deepfake Detection DeepFake Detection
— Unverified 0Review of end-to-end speech synthesis technology based on deep learning Apr 20, 2021 Speech Synthesis
— Unverified 0ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement Dec 21, 2022 Audio-Visual Speech Recognition Resynthesis
— Unverified 0ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration Jan 1, 2023 Audio-Visual Speech Recognition Resynthesis
— Unverified 0Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis May 25, 2025 Speech Synthesis text-to-speech
— Unverified 0Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis Jun 5, 2023 Rhythm Sentence
— Unverified 0