SOTAVerified

Polyphone disambiguation

A part of the TTS-front end framework which serves to predict the correct pronunciation for the input polyphone characters.

Papers

Showing 110 of 15 papers

TitleStatusHype
g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in MandarinCode2
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-SpeechCode1
g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark DatasetCode1
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT0
External Knowledge Augmented Polyphone Disambiguation Using Large Language Model0
Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data0
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end0
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1g2pWAccuracy99.08Unverified
2g2pM (BERT)Accuracy97.85Unverified
3g2pM (BiLSTM)Accuracy97.31Unverified