Hierarchical Transformer Encoders for Vietnamese Spelling Correction
2021-05-28Code Available1· sign in to hype
Hieu Tran, Cuong V. Dinh, Long Phan, Son T. Nguyen
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/heraclex12/Viwiki-spellingOfficialIn papernone★ 15
Abstract
In this paper, we propose a Hierarchical Transformer model for Vietnamese spelling correction problem. The model consists of multiple Transformer encoders and utilizes both character-level and word-level to detect errors and make corrections. In addition, to facilitate future work in Vietnamese spelling correction tasks, we propose a realistic dataset collected from real-life texts for the problem. We compare our method with other methods and publicly available systems. The proposed method outperforms all of the contemporary methods in terms of recall, precision, and f1-score. A demo version is publicly available.