A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning

2019-07-02WS 2019Code Available0· sign in to hype

Yo Joong Choe, Jiyeon Ham, Kyubyong Park, Yeoil Yoon

Code Available — Be the first to reproduce this paper.

Code

github.com/kakaobrain/helo_word
OfficialIn paperpytorch★ 0
github.com/kakaobrain/helo-word
pytorch★ 0

Abstract

Grammatical error correction can be viewed as a low-resource sequence-to-sequence task, because publicly available parallel corpora are limited. To tackle this challenge, we first generate erroneous versions of large unannotated corpora using a realistic noising function. The resulting parallel corpora are subsequently used to pre-train Transformer models. Then, by sequentially applying transfer learning, we adapt these models to the domain and style of the test set. Combined with a context-aware neural spellchecker, our system achieves competitive results in both restricted and low resource tracks in ACL 2019 BEA Shared Task. We release all of our code and materials for reproducibility.

Tasks

Grammatical Error Correction Transfer Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
BEA-2019 (test)	Transformer	F0.5	69	—	Unverified
BEA-2019 (test)	Transformer	F0.5	69.5	—	Unverified

A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions