Grammatical Error Correction

Grammatical Error Correction (GEC) is the task of correcting different kinds of errors in text such as spelling, punctuation, grammatical, and word choice errors.

GEC is typically formulated as a sentence correction task. A GEC system takes a potentially erroneous sentence as input and is expected to transform it to its corrected version. See the example given below:

| Input (Erroneous) | Output (Corrected) | | ------------------------- | ---------------------- | |She see Tom is catched by policeman in park at last night. | She saw Tom caught by a policeman in the park last night.|

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 376–400 of 415 papers

Title	Date	Tasks	Status
Towards a standard evaluation method for grammatical error detection and correction	May 1, 2015	Grammatical Error CorrectionGrammatical Error Detection	CodeCode Available
Neural Quality Estimation of Grammatical Error Correction	Oct 1, 2018	Grammatical Error CorrectionMachine Translation	CodeCode Available
IMPARA: Impact-Based Metric for GEC Using Parallel Data	Oct 1, 2022	Grammatical Error Correction	CodeCode Available
Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks	Oct 20, 2023	Grammatical Error CorrectionText Simplification	CodeCode Available
Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora	May 29, 2023	Grammatical Error Correction	CodeCode Available
Improving Explainability of Sentence-level Metrics via Edit-level Attribution for Grammatical Error Correction	Dec 17, 2024	AttributeGrammatical Error Correction	CodeCode Available
A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning	Jul 2, 2019	Grammatical Error CorrectionTransfer Learning	CodeCode Available
Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task	Apr 16, 2018	Domain AdaptationGrammatical Error Correction	CodeCode Available
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data	Mar 1, 2019	DenoisingGrammatical Error Correction	CodeCode Available
Improving Grammatical Error Correction via Contextual Data Augmentation	Jun 25, 2024	Data AugmentationGrammatical Error Correction	CodeCode Available
Improving Grammatical Error Correction with Machine Translation Pairs	Nov 7, 2019	Grammatical Error CorrectionLanguage Modeling	CodeCode Available
Using Wikipedia Edits in Low Resource Grammatical Error Correction	Nov 1, 2018	DecoderGrammatical Error Correction	CodeCode Available
ErAConD : Error Annotated Conversational Dialog Dataset for Grammatical Error Correction	Dec 15, 2021	ChatbotGrammatical Error Correction	CodeCode Available
Seq2Edits: Sequence Transduction Using Span-level Edit Operations	Sep 23, 2020	Grammatical Error CorrectionSentence	CodeCode Available
Towards Lithuanian grammatical error correction	Mar 18, 2022	Grammatical Error Correction	CodeCode Available
Enhancing Grammatical Error Detection using BERT with Cleaned Lang-8 Dataset	Nov 23, 2024	Grammatical Error CorrectionGrammatical Error Detection	CodeCode Available
Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection	Sep 26, 2018	Grammatical Error CorrectionGrammatical Error Detection	CodeCode Available
Inherent Biases in Reference based Evaluation for Grammatical Error Correction and Text Simplification	Apr 30, 2018	Grammatical Error CorrectionSentence	CodeCode Available
Inherent Biases in Reference-based Evaluation for Grammatical Error Correction	Jul 1, 2018	Grammatical Error CorrectionSentence	CodeCode Available
Efficient and Interpretable Grammatical Error Correction with Mixture of Experts	Oct 30, 2024	Grammatical Error CorrectionMixture-of-Experts	CodeCode Available
DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models	Dec 17, 2024	Grammatical Error CorrectionLanguage Modeling	CodeCode Available
Some Grammatical Errors are Frequent, Others are Important	May 11, 2022	Grammatical Error Correction	CodeCode Available
Is this the end of the gold standard? A straightforward reference-less grammatical error correction metric	Nov 1, 2021	Grammatical Error CorrectionSentence	CodeCode Available
SOME: Reference-less Sub-Metrics Optimized for Manual Evaluations of Grammatical Error Correction	Dec 1, 2020	Grammatical Error CorrectionSentence	CodeCode Available
An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction	Sep 2, 2019	Grammatical Error Correction	CodeCode Available

Show:10 25 50

← PrevPage 16 of 17Next →

All datasets CoNLL-2014 Shared Task BEA-2019 (test)Falko-MERLIN JFLEG UA-GEC CoNLL-2014 Shared Task (10 annotations)Restricted Unrestricted _Restricted_EstGEC-L2 FCGEC MuCGEC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ensembles of best 7 models + GRECO + GTP-rerank	F0.5	72.8	—	Unverified
2	Majority-voting ensemble on best 7 models	F0.5	71.8	—	Unverified
3	GRECO (voting+ESC)	F0.5	71.12	—	Unverified
4	GEC-DI (LM+GED)	F0.5	69.6	—	Unverified
5	Unsupervised GEC + cLang8	F0.5	69.6	—	Unverified
6	ESC	F0.5	69.51	—	Unverified
7	T5	F0.5	68.87	—	Unverified
8	MoECE	F0.5	67.79	—	Unverified
9	SynGEC	F0.5	67.6	—	Unverified
10	Sequence tagging + token-level transformations + two-stage fine-tuning (+BERT, RoBERTa, XLNet)	F0.5	66.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Majority-voting ensemble on best 7 models	F0.5	81.4	—	Unverified
2	GRECO (voting+ESC)	F0.5	80.84	—	Unverified
3	ESC	F0.5	79.9	—	Unverified
4	RedPenNet	F0.5	77.6	—	Unverified
5	clang_large_ft2-gector	F0.5	77.1	—	Unverified
6	Unsupervised GEC + cLang8	F0.5	76.5	—	Unverified
7	DeBERTa + RoBERTa + XLNet	F0.5	76.05	—	Unverified
8	MoECE	F0.5	74.07	—	Unverified
9	Sequence tagging + token-level transformations + two-stage fine-tuning (+RoBERTa, XLNet)	F0.5	73.7	—	Unverified
10	BEA Combination	F0.5	73.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	76.75	—	Unverified
2	mT5-based multimodal MoE	F0.5	76.3	—	Unverified
3	gT5 xxl	F0.5	75.96	—	Unverified
4	Transformer	F0.5	73.71	—	Unverified
5	Transformer - synthetic pretrain only	F0.5	51.41	—	Unverified
6	Multilayer Convolutional Encoder-Decoder	F0.5	43.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VERNet	GLEU	62.1	—	Unverified
2	Transformer + Pre-train with Pseudo Data + BERT	GLEU	62	—	Unverified
3	SMT + BiGRU	GLEU	61.5	—	Unverified
4	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	GLEU	61	—	Unverified
5	Transformer	GLEU	59.9	—	Unverified
6	CNN Seq2Seq	GLEU	57.47	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	74.09	—	Unverified
2	mBART-based model with synthetic data	F0.5	68.17	—	Unverified
3	mT5 large + 10M synth	F0.5	68.09	—	Unverified
4	RedPenNet	F0.5	67.71	—	Unverified
5	ChatGPT (zero-shot)	F0.5	27.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GRECO (vote+ESC)	F0.5	85.21	—	Unverified
2	SMT + BiGRU	F0.5	72.04	—	Unverified
3	CNN Seq2Seq	F0.5	70.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN Seq2Seq + Quality Estimation	F0.5	56.52	—	Unverified
2	Transformer	F0.5	55.8	—	Unverified
3	+ BIFI with no critic	F0.5	18.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN Seq2Seq + Fluency Boost and inference	GLEU	62.37	—	Unverified
2	CNN Seq2Seq + Fluency Boost	F0.5	61.34	—	Unverified
3	+ BIFI (ours)	F0.5	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer	GLEU	59.9	—	Unverified
2	CNN Seq2Seq	GLEU	57.47	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	69.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STG-Joint	exact match	34.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GEC-DI (LM+GED)	F0.5	48.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RedPenNet	F0.5	77.6	—	Unverified