Non-Autoregressive Chinese ASR Error Correction with Phonological Training

2022-07-01NAACL 2022Unverified0· sign in to hype

Zheng Fang, Ruiqing Zhang, Zhongjun He, Hua Wu, Yanan Cao

Unverified — Be the first to reproduce this paper.

Abstract

Automatic Speech Recognition (ASR) is an efficient and widely used input method that transcribes speech signals into text. As the errors introduced by ASR systems will impair the performance of downstream tasks, we introduce a post-processing error correction method, PhVEC, to correct errors in text space. For the errors in ASR result, existing works mainly focus on fixed-length corrections, modifying each wrong token to a correct one (one-to-one correction), but rarely consider the variable-length correction (one-to-many or many-to-one correction). In this paper, we propose an efficient non-autoregressive (NAR) method for Chinese ASR error correction for both cases. Instead of conventionally predicting the sentence length in NAR methods, we propose a novel approach that uses phonological tokens to extend the source sentence for variable-length correction, enabling our model to generate phonetically similar corrections. Experimental results on datasets of different domains show that our method achieves significant improvement in word error rate reduction and speeds up the inference by 6.2 times compared with the autoregressive model.

Tasks

Automatic Speech Recognition Automatic Speech Recognition (ASR)Sentence speech-recognition Speech Recognition

Non-Autoregressive Chinese ASR Error Correction with Phonological Training

Abstract

Tasks

Reproductions