SOTAVerified

Polyphone disambiguation

A part of the TTS-front end framework which serves to predict the correct pronunciation for the input polyphone characters.

Papers

Showing 110 of 15 papers

TitleStatusHype
g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in MandarinCode2
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-SpeechCode1
g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark DatasetCode1
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT0
Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling0
External Knowledge Augmented Polyphone Disambiguation Using Large Language Model0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1g2pWAccuracy99.08Unverified
2g2pM (BERT)Accuracy97.85Unverified
3g2pM (BiLSTM)Accuracy97.31Unverified