POS-Tag Based Poetry Generation with WordNet Aug 1, 2013 POS Speech Synthesis
— Unverified 00 Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems May 1, 2012 Speech Synthesis
— Unverified 00 Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis Aug 4, 2018 Speech Synthesis text-to-speech
— Unverified 00 Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis Nov 2, 2022 Expressive Speech Synthesis Speech Synthesis
— Unverified 00 Predicting Phrase Breaks in Classical and Modern Standard Arabic Text May 1, 2012 Chunking Human Parsing
— Unverified 00 Predicting Romanian Stress Assignment Apr 1, 2014 Speech Synthesis Text-To-Speech Synthesis
— Unverified 00 Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Jun 25, 2021 Quantization Speaker anonymization
— Unverified 00 Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis Nov 10, 2020 Speech Synthesis
— Unverified 00 PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior Jun 11, 2021 Audio Generation Denoising
— Unverified 00 Privacy-oriented manipulation of speaker representations Oct 10, 2023 Speaker Recognition Speech Synthesis
— Unverified 00 Probabilistic Dialogue Models with Prior Domain Knowledge Jul 1, 2012 Dialogue Management Semantic Parsing
— Unverified 00 Probing Speaker-specific Features in Speaker Representations Jan 9, 2025 Self-Supervised Learning Speaker Verification
— Unverified 00 Probing the Feasibility of Multilingual Speaker Anonymization Jul 3, 2024 Speaker anonymization Speech Synthesis
— Unverified 00 PROEMO: Prompt-Driven Text-to-Speech Synthesis Based on Emotion and Intensity Control Jan 10, 2025 Speech Synthesis text-to-speech
— Unverified 00 PSCodec: A Series of High-Fidelity Low-bitrate Neural Speech Codecs Leveraging Prompt Encoders Apr 3, 2024 Representation Learning Speaker Verification
— Unverified 00 Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions Jun 3, 2025 Expressive Speech Synthesis Prompt Learning
— Unverified 00 Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations Jun 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis Nov 19, 2021 Clustering Decoder
— Unverified 00 Prosodic Prominence and Boundaries in Sequence-to-Sequence Speech Synthesis Jun 29, 2020 Sentence Speech Synthesis
— Unverified 00 Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech Nov 4, 2020 Graph Attention Representation Learning
— Unverified 00 ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis Dec 16, 2024 Speech Synthesis text-to-speech
— Unverified 00 Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit Aug 13, 2020 Language Modeling Language Modelling
— Unverified 00 Prosody-TTS: An end-to-end speech synthesis system with prosody control Oct 6, 2021 Rhythm Speech Synthesis
— Unverified 00 Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis Apr 14, 2025 Language Modeling Language Modelling
— Unverified 00 Punjabi Text-To-Speech Synthesis System Dec 1, 2012 Speech Synthesis text-to-speech
— Unverified 00 PyDial: A Multi-domain Statistical Dialogue System Toolkit Jul 1, 2017 Dialogue Management Speech Recognition
— Unverified 00 pyiwn: A Python based API to access Indian Language WordNets Jan 1, 2018 Speech Synthesis
— Unverified 00 QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis Mar 14, 2023 Emotional Speech Synthesis Sentence
— Unverified 00 Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective Sep 29, 2024 Audio-Visual Speech Recognition Lip Reading
— Unverified 00 RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis Apr 4, 2024 Language Modeling Language Modelling
— Unverified 00 RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations May 24, 2025 Expressive Speech Synthesis Speech Synthesis
— Unverified 00 Real-time Incremental Speech-to-Speech Translation of Dialogs Jun 1, 2012 Machine Translation Speech Recognition
— Unverified 00 Real-Time Single-Speaker Taiwanese-Accented Mandarin Speech Synthesis System Sep 1, 2020 Speech Synthesis
— Unverified 00 ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence May 9, 2022 Speech Synthesis text-to-speech
— Unverified 00 Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis Jan 26, 2016 General Classification regression
— Unverified 00 RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech Oct 26, 2022 Speech Synthesis
— Unverified 00 Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis Sep 8, 2021 Expressive Speech Synthesis Sentence
— Unverified 00 Refer-iTTS: A System for Referring in Spoken Installments to Objects in Real-World Images Sep 1, 2017 Referring Expression Referring expression generation
— Unverified 00 Regeneration Learning: A Learning Paradigm for Data Generation Jan 21, 2023 Image Generation Representation Learning
— Unverified 00 Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability Apr 3, 2021 Emotion Recognition reinforcement-learning
— Unverified 00 DLPO: Diffusion Model Loss-Guided Reinforcement Learning for Fine-Tuning Text-to-Speech Diffusion Models May 23, 2024 Image Generation reinforcement-learning
— Unverified 00 Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling Apr 1, 2024 Speaker Identification Speech Synthesis
— Unverified 00 Replay Spoofing Countermeasure Using Autoencoder and Siamese Network on ASVspoof 2019 Challenge Oct 29, 2019 Speaker Verification Speech Synthesis
— Unverified 00 Residual-guided Personalized Speech Synthesis based on Face Image Apr 1, 2022 Speech Synthesis
— Unverified 00 Response Generation Based on Hierarchical Semantic Structure with POMDP Re-ranking for Conversational Dialogue Systems Oct 1, 2013 Dialogue Management Information Retrieval
— Unverified 00 Retrieval-Augmented Audio Deepfake Detection Apr 22, 2024 Audio Deepfake Detection DeepFake Detection
— Unverified 00 Review of end-to-end speech synthesis technology based on deep learning Apr 20, 2021 Speech Synthesis
— Unverified 00 ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement Dec 21, 2022 Audio-Visual Speech Recognition Resynthesis
— Unverified 00 ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration Jan 1, 2023 Audio-Visual Speech Recognition Resynthesis
— Unverified 00 Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis May 25, 2025 Speech Synthesis text-to-speech
— Unverified 00