SOTAVerified

Polyphone disambiguation

A part of the TTS-front end framework which serves to predict the correct pronunciation for the input polyphone characters.

Papers

Showing 110 of 15 papers

TitleStatusHype
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT0
Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling0
External Knowledge Augmented Polyphone Disambiguation Using Large Language Model0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-SpeechCode1
g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in MandarinCode2
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end0
Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1g2pWAccuracy99.08Unverified
2g2pM (BERT)Accuracy97.85Unverified
3g2pM (BiLSTM)Accuracy97.31Unverified