SOTAVerified

Prosody Prediction

Predicting prosodic prominence from text. This is a 2-way classification task, assigning each word in a sentence a label 1 (prominent) or 0 (non-prominent).

( Image credit: Helsinki Prosody Corpus )

Papers

Showing 115 of 15 papers

TitleStatusHype
PRESENT: Zero-Shot Text-to-Prosody ControlCode1
On the Utility of Self-supervised Models for Prosody-related TasksCode1
VisualSpeech: Enhance Prosody with Visual Context in TTS0
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles0
Word-wise intonation model for cross-language TTS systems0
Prosody Analysis of AudiobooksCode0
A Comparative Analysis of Pretrained Language Models for Text-to-Speech0
Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data0
What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model0
Ensemble prosody prediction for expressive speech synthesis0
Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis0
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit0
Controllable Sequence-To-Sequence Neural TTS with LPCNET Backend for Real-time Speech Synthesis on CPU0
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word RepresentationsCode0
Automatic Prosody Prediction for Chinese Speech Synthesis using BLSTM-RNN and Embedding Features0
Show:102550

No leaderboard results yet.