SOTAVerified

Chinese Grammatical Error Diagnosis with Long Short-Term Memory Networks

2016-12-01WS 2016Unverified0· sign in to hype

Bo Zheng, Wanxiang Che, Jiang Guo, Ting Liu

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Grammatical error diagnosis is an important task in natural language processing. This paper introduces our Chinese Grammatical Error Diagnosis (CGED) system in the NLP-TEA-3 shared task for CGED. The CGED system can diagnose four types of grammatical errors which are redundant words (R), missing words (M), bad word selection (S) and disordered words (W). We treat the CGED task as a sequence labeling task and describe three models, including a CRF-based model, an LSTM-based model and an ensemble model using stacking. We also show in details how we build and train the models. Evaluation includes three levels, which are detection level, identification level and position level. On the CGED-HSK dataset of NLP-TEA-3 shared task, our system presents the best F1-scores in all the three levels and also the best recall in the last two levels.

Tasks

Reproductions