SOTAVerified

Garnishing a phonetic dictionary for ASR intake

2019-09-01WS (NoDaLiDa) 2019Unverified0· sign in to hype

Iben Nyholm Debess, Sandra Saxov Lamhauge, Peter Juel Henrichsen

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present a new method for preparing a lexical-phonetic database as a resource for acoustic model training. The research is an offshoot of the ongoing Project Ravnur (Speech Recognition for Faroese), but the method is language-independent. At NODALIDA 2019 we demonstrate the method (called SHARP) online, showing how a traditional lexical-phonetic dictionary (with a very rich phone inventory) is transformed into an ASR-friendly database (with reduced phonetics, preventing data sparseness). The mapping procedure is informed by a corpus of speech transcripts. We conclude with a discussion on the benefits of a well-thought-out BLARK design (Basic Language Resource Kit), making tools like SHARP possible.

Tasks

Reproductions