Grammatical Error Correction

Grammatical Error Correction (GEC) is the task of correcting different kinds of errors in text such as spelling, punctuation, grammatical, and word choice errors.

GEC is typically formulated as a sentence correction task. A GEC system takes a potentially erroneous sentence as input and is expected to transform it to its corrected version. See the example given below:

| Input (Erroneous) | Output (Corrected) | | ------------------------- | ---------------------- | |She see Tom is catched by policeman in park at last night. | She saw Tom caught by a policeman in the park last night.|

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 415 papers

Title	Date	Tasks	Status
Phrase Structure Annotation and Parsing for Learner English	Aug 1, 2016	Grammatical Error CorrectionPart-Of-Speech Tagging	—Unverified
Position Offset Label Prediction for Grammatical Error Correction	Oct 1, 2022	Data AugmentationDecoder	—Unverified
POSTECH Grammatical Error Correction System in the CoNLL-2014 Shared Task	Jun 1, 2014	Grammatical Error CorrectionLanguage Modelling	—Unverified
Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing	Jan 23, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Proficiency Matters Quality Estimation in Grammatical Error Correction	Jan 17, 2022	Grammatical Error Correction	—Unverified
Prompting open-source and commercial language models for grammatical error correction of English learner text	Jan 15, 2024	Grammatical Error Correction	—Unverified
Proofread Sentence Generation as Multi-Task Learning with Editing Operation Prediction	Nov 1, 2017	ArticlesGrammatical Error Correction	—Unverified
ProQE: Proficiency-wise Quality Estimation dataset for Grammatical Error Correction	Jun 1, 2022	Grammatical Error Correction	—Unverified
Pseudo-Bidirectional Decoding for Local Sequence Transduction	Jan 31, 2020	DecoderGrammatical Error Correction	—Unverified
Pseudo-Error Generation for Grammatical Error Correction Based on Learner’s First Language	Sep 17, 2021	Grammatical Error CorrectionTranslation	—Unverified
RACAI GEC -- A hybrid approach to Grammatical Error Correction	Jun 1, 2014	Grammatical Error CorrectionGrammatical Error Detection	—Unverified
Reducing Sequence Length by Predicting Edit Operations with Large Language Models	May 19, 2023	Formality Style TransferGrammatical Error Correction	—Unverified
Reference-based Metrics can be Replaced with Reference-less Metrics in Evaluating Grammatical Error Correction Systems	Nov 1, 2017	Grammatical Error CorrectionMachine Translation	—Unverified
Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction	Feb 18, 2024	Grammatical Error Correction	—Unverified
Robust and Effective Grammatical Error Correction with Simple Cycle Self-Augmenting	Nov 16, 2021	Adversarial AttackGrammatical Error Correction	—Unverified
Robust Systems for Preposition Error Correction Using Wikipedia Revisions	Jun 1, 2013	Grammatical Error CorrectionParaphrase Generation	—Unverified
Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction	May 27, 2025	Grammatical Error Correction	—Unverified
Semantic Parsing for English as a Second Language	Jul 1, 2020	Grammatical Error CorrectionLanguage Acquisition	—Unverified
Semi-automatically Annotated Learner Corpus for Russian	Jun 1, 2022	Grammatical Error CorrectionGrammatical Error Detection	—Unverified
Sentential Paraphrasing as Black-Box Machine Translation	Jun 1, 2016	Grammatical Error CorrectionLanguage Modeling	—Unverified
Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation	May 22, 2022	Grammatical Error CorrectionSentence	—Unverified
Sequence-to-sequence Pre-training with Data Augmentation for Sentence Rewriting	Sep 13, 2019	Data AugmentationFormality Style Transfer	—Unverified
Speak & Improve Challenge 2025: Tasks and Baseline Systems	Dec 16, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Speak & Improve Corpus 2025: an L2 English Speech Corpus for Language Assessment and Feedback	Dec 16, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Spivavtor: An Instruction Tuned Ukrainian Text Editing Model	Apr 29, 2024	Grammatical Error Correctionmodel	—Unverified
Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model	Dec 1, 2020	DecoderGrammatical Error Correction	—Unverified
Synthetic Alone: Exploring the Dark Side of Synthetic Data for Grammatical Error Correction	Jun 26, 2023	Grammatical Error Correction	—Unverified
System Combination for Grammatical Error Correction	Oct 1, 2014	Grammatical Error CorrectionMachine Translation	—Unverified
Tense and Aspect Error Correction for ESL Learners Using Global Context	Jul 1, 2012	Grammatical Error Correction	—Unverified
Text Generation with Text-Editing Models	Jun 14, 2022	Grammatical Error CorrectionHallucination	—Unverified
The AIP-Tohoku System at the BEA-2019 Shared Task	Aug 1, 2019	Grammatical Error CorrectionGrammatical Error Detection	—Unverified
The AMU System in the CoNLL-2014 Shared Task: Grammatical Error Correction by Data-Intensive and Feature-Rich Statistical Machine Translation	Jun 1, 2014	Grammatical Error CorrectionLanguage Modelling	—Unverified
The BEA-2019 Shared Task on Grammatical Error Correction	Aug 1, 2019	Grammatical Error Correction	—Unverified
The BLCU System in the BEA 2019 Shared Task	Aug 1, 2019	Grammatical Error Correction	—Unverified
The Columbia System in the QALB-2014 Shared Task on Arabic Error Correction	Oct 1, 2014	Grammatical Error Correction	—Unverified
The CoNLL-2013 Shared Task on Grammatical Error Correction	Aug 1, 2013	Coreference ResolutionDependency Parsing	—Unverified
The CUED's Grammatical Error Correction Systems for BEA-2019	Jun 29, 2019	Grammatical Error CorrectionMachine Translation	—Unverified
The Effect of Error Rate in Artificially Generated Data for Automatic Preposition and Determiner Correction	Sep 1, 2017	Grammatical Error CorrectionMachine Translation	—Unverified
The Effect of Learner Corpus Size in Grammatical Error Correction of ESL Writings	Dec 1, 2012	Grammatical Error CorrectionMachine Translation	—Unverified
The Illinois-Columbia System in the CoNLL-2014 Shared Task	Jun 1, 2014	Grammatical Error Correction	—Unverified
The LAIX Systems in the BEA-2019 GEC Shared Task	Aug 1, 2019	ClassificationGeneral Classification	—Unverified
The Unbearable Weight of Generating Artificial Errors for Grammatical Error Correction	Jul 21, 2019	Grammatical Error Correction	—Unverified
The Write & Improve Corpus 2024: Error-annotated and CEFR-labelled essays by learners of English	Oct 23, 2024	DescriptiveGrammatical Error Correction	—Unverified
Tibyan Corpus: Balanced and Comprehensive Error Coverage Corpus Using ChatGPT for Arabic Grammatical Error Correction	Nov 7, 2024	Data AugmentationGrammatical Error Correction	—Unverified
TMU-NLP System Using BERT-based Pre-trained Model to the NLP-TEA CGED Shared Task 2020	Dec 1, 2020	DecoderGrammatical Error Correction	—Unverified
TMU Transformer System Using BERT for Re-ranking at BEA 2019 Grammatical Error Correction on Restricted Track	Aug 1, 2019	Grammatical Error CorrectionRe-Ranking	—Unverified
Toward More Precision in Correction of Grammatical Errors	Aug 1, 2013	Grammatical Error Correction	—Unverified
Towards End-to-End Spoken Grammatical Error Correction	Nov 9, 2023	Grammatical Error Correctionspeech-recognition	—Unverified
Towards Minimal Supervision BERT-based Grammar Error Correction	Jan 10, 2020	Grammatical Error CorrectionLanguage Modeling	—Unverified
Towards Universal Dependencies for Learner Chinese	May 1, 2017	Grammatical Error Correction	—Unverified

Show:10 25 50

← PrevPage 5 of 9Next →

All datasets CoNLL-2014 Shared Task BEA-2019 (test)Falko-MERLIN JFLEG UA-GEC CoNLL-2014 Shared Task (10 annotations)Restricted Unrestricted _Restricted_EstGEC-L2 FCGEC MuCGEC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ensembles of best 7 models + GRECO + GTP-rerank	F0.5	72.8	—	Unverified
2	Majority-voting ensemble on best 7 models	F0.5	71.8	—	Unverified
3	GRECO (voting+ESC)	F0.5	71.12	—	Unverified
4	GEC-DI (LM+GED)	F0.5	69.6	—	Unverified
5	Unsupervised GEC + cLang8	F0.5	69.6	—	Unverified
6	ESC	F0.5	69.51	—	Unverified
7	T5	F0.5	68.87	—	Unverified
8	MoECE	F0.5	67.79	—	Unverified
9	SynGEC	F0.5	67.6	—	Unverified
10	Sequence tagging + token-level transformations + two-stage fine-tuning (+BERT, RoBERTa, XLNet)	F0.5	66.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Majority-voting ensemble on best 7 models	F0.5	81.4	—	Unverified
2	GRECO (voting+ESC)	F0.5	80.84	—	Unverified
3	ESC	F0.5	79.9	—	Unverified
4	RedPenNet	F0.5	77.6	—	Unverified
5	clang_large_ft2-gector	F0.5	77.1	—	Unverified
6	Unsupervised GEC + cLang8	F0.5	76.5	—	Unverified
7	DeBERTa + RoBERTa + XLNet	F0.5	76.05	—	Unverified
8	MoECE	F0.5	74.07	—	Unverified
9	Sequence tagging + token-level transformations + two-stage fine-tuning (+RoBERTa, XLNet)	F0.5	73.7	—	Unverified
10	BEA Combination	F0.5	73.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	76.75	—	Unverified
2	mT5-based multimodal MoE	F0.5	76.3	—	Unverified
3	gT5 xxl	F0.5	75.96	—	Unverified
4	Transformer	F0.5	73.71	—	Unverified
5	Transformer - synthetic pretrain only	F0.5	51.41	—	Unverified
6	Multilayer Convolutional Encoder-Decoder	F0.5	43.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VERNet	GLEU	62.1	—	Unverified
2	Transformer + Pre-train with Pseudo Data + BERT	GLEU	62	—	Unverified
3	SMT + BiGRU	GLEU	61.5	—	Unverified
4	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	GLEU	61	—	Unverified
5	Transformer	GLEU	59.9	—	Unverified
6	CNN Seq2Seq	GLEU	57.47	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	74.09	—	Unverified
2	mBART-based model with synthetic data	F0.5	68.17	—	Unverified
3	mT5 large + 10M synth	F0.5	68.09	—	Unverified
4	RedPenNet	F0.5	67.71	—	Unverified
5	ChatGPT (zero-shot)	F0.5	27.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GRECO (vote+ESC)	F0.5	85.21	—	Unverified
2	SMT + BiGRU	F0.5	72.04	—	Unverified
3	CNN Seq2Seq	F0.5	70.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN Seq2Seq + Quality Estimation	F0.5	56.52	—	Unverified
2	Transformer	F0.5	55.8	—	Unverified
3	+ BIFI with no critic	F0.5	18.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN Seq2Seq + Fluency Boost and inference	GLEU	62.37	—	Unverified
2	CNN Seq2Seq + Fluency Boost	F0.5	61.34	—	Unverified
3	+ BIFI (ours)	F0.5	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer	GLEU	59.9	—	Unverified
2	CNN Seq2Seq	GLEU	57.47	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	69.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STG-Joint	exact match	34.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GEC-DI (LM+GED)	F0.5	48.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RedPenNet	F0.5	77.6	—	Unverified