Grammatical Error Correction

Grammatical Error Correction (GEC) is the task of correcting different kinds of errors in text such as spelling, punctuation, grammatical, and word choice errors.

GEC is typically formulated as a sentence correction task. A GEC system takes a potentially erroneous sentence as input and is expected to transform it to its corrected version. See the example given below:

| Input (Erroneous) | Output (Corrected) | | ------------------------- | ---------------------- | |She see Tom is catched by policeman in park at last night. | She saw Tom caught by a policeman in the park last night.|

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 415 papers

Title	Date	Tasks	Status
GEE! Grammar Error Explanation with Large Language Models	Nov 16, 2023	Grammatical Error CorrectionSentence	CodeCode Available
GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding	Nov 14, 2023	DecoderDenoising	CodeCode Available
Towards End-to-End Spoken Grammatical Error Correction	Nov 9, 2023	Grammatical Error Correctionspeech-recognition	—Unverified
TLM: Token-Level Masking for Transformers	Oct 28, 2023	Data-to-Text GenerationGrammatical Error Correction	CodeCode Available
Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting	Oct 20, 2023	Adversarial AttackGrammatical Error Correction	CodeCode Available
Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks	Oct 20, 2023	Grammatical Error CorrectionText Simplification	CodeCode Available
Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction	Sep 20, 2023	Grammatical Error Correction	CodeCode Available
RedPenNet for Grammatical Error Correction: Outputs to Tokens, Attentions to Spans	Sep 19, 2023	Grammatical Error CorrectionMachine Translation	CodeCode Available
HTEC: Human Transcription Error Correction	Sep 18, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Minimum Bayes' Risk Decoding for System Combination of Grammatical Error Correction Systems	Sep 12, 2023	Grammatical Error Correction	CodeCode Available
Evaluation of really good grammatical error correction	Aug 17, 2023	Grammatical Error CorrectionGrammatical Error Detection	CodeCode Available
ChatGPT for Arabic Grammatical Error Correction	Aug 8, 2023	Few-Shot LearningGrammatical Error Correction	—Unverified
On the (In)Effectiveness of Large Language Models for Chinese Text Correction	Jul 18, 2023	Grammatical Error Correction	—Unverified
On the application of Large Language Models for language teaching and assessment technology	Jul 17, 2023	Grammatical Error CorrectionMisinformation	—Unverified
Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task	Jul 8, 2023	Grammatical Error Correction	—Unverified
Leveraging Denoised Abstract Meaning Representation for Grammatical Error Correction	Jul 5, 2023	Abstract Meaning RepresentationDenoising	—Unverified
A Language Model for Grammatical Error Correction in L2 Russian	Jul 4, 2023	Grammatical Error CorrectionLanguage Modeling	—Unverified
Evaluating GPT-3.5 and GPT-4 on Grammatical Error Correction for Brazilian Portuguese	Jun 27, 2023	Grammatical Error Correction	—Unverified
Synthetic Alone: Exploring the Dark Side of Synthetic Data for Grammatical Error Correction	Jun 26, 2023	Grammatical Error Correction	—Unverified
Gender-Inclusive Grammatical Error Correction through Augmentation	Jun 12, 2023	Data AugmentationGrammatical Error Correction	CodeCode Available
Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora	May 29, 2023	Grammatical Error Correction	CodeCode Available
Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods	May 29, 2023	Grammatical Error Correction	—Unverified
IdEALS: Idiomatic Expressions for Advancement of Language Skills	May 23, 2023	Grammatical Error CorrectionSentence	—Unverified
Bidirectional Transformer Reranker for Grammatical Error Correction	May 22, 2023	DecoderGrammatical Error Correction	CodeCode Available
Reducing Sequence Length by Predicting Edit Operations with Large Language Models	May 19, 2023	Formality Style TransferGrammatical Error Correction	—Unverified
A Low-Resource Approach to the Grammatical Error Correction of Ukrainian	May 5, 2023	Grammatical Error CorrectionLanguage Modeling	CodeCode Available
Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation	Apr 4, 2023	Grammatical Error CorrectionIn-Context Learning	—Unverified
A BERT-based Unsupervised Grammatical Error Correction Framework	Mar 30, 2023	Grammatical Error CorrectionLanguage Modeling	—Unverified
Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error Correction	Mar 25, 2023	Grammatical Error CorrectionSentence	—Unverified
ChatGPT or Grammarly? Evaluating ChatGPT on Grammatical Error Correction Benchmark	Mar 15, 2023	Grammatical Error CorrectionLanguage Modeling	—Unverified
CSynGEC: Incorporating Constituent-based Syntax for Grammatical Error Correction with a Tailored GEC-Oriented Parser	Nov 15, 2022	Grammatical Error CorrectionSentence	—Unverified
Grammatical Error Correction: A Survey of the State of the Art	Nov 9, 2022	Grammatical Error CorrectionMachine Translation	—Unverified
From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction	Nov 3, 2022	Data AugmentationGrammatical Error Correction	—Unverified
Focus Is What You Need For Chinese Grammatical Error Correction	Oct 23, 2022	Grammatical Error CorrectionSentence	—Unverified
Text Editing as Imitation Game	Oct 21, 2022	Action GenerationGrammatical Error Correction	CodeCode Available
IMPARA: Impact-Based Metric for GEC Using Parallel Data	Oct 1, 2022	Grammatical Error Correction	CodeCode Available
Grammatical Error Correction: Are We There Yet?	Oct 1, 2022	Grammatical Error Correction	—Unverified
Multi-Perspective Document Revision	Oct 1, 2022	Grammatical Error CorrectionRelation Classification	—Unverified
Position Offset Label Prediction for Grammatical Error Correction	Oct 1, 2022	Data AugmentationDecoder	—Unverified
Dynamic Negative Example Construction for Grammatical Error Correction using Contrastive Learning	Oct 1, 2022	Contrastive LearningGrammatical Error Correction	—Unverified
Judge a Sentence by Its Content to Generate Grammatical Errors	Aug 20, 2022	Grammatical Error CorrectionSentence	—Unverified
Gender Bias and Universal Substitution Adversarial Attacks on Grammatical Error Correction Systems for Automated Assessment	Aug 19, 2022	Adversarial AttackGrammatical Error Correction	—Unverified
On Assessing and Developing Spoken ’Grammatical Error Correction’ Systems	Jul 1, 2022	Grammatical Error Correctionspeech-recognition	—Unverified
Text Generation with Text-Editing Models	Jun 14, 2022	Grammatical Error CorrectionHallucination	—Unverified
Developing a Spell and Grammar Checker for Icelandic using an Error Corpus	Jun 1, 2022	Grammatical Error CorrectionSentence	—Unverified
ProQE: Proficiency-wise Quality Estimation dataset for Grammatical Error Correction	Jun 1, 2022	Grammatical Error Correction	—Unverified
Semi-automatically Annotated Learner Corpus for Russian	Jun 1, 2022	Grammatical Error CorrectionGrammatical Error Detection	—Unverified
Automatic Classification of Russian Learner Errors	Jun 1, 2022	ClassificationGrammatical Error Correction	—Unverified
MTee: Open Machine Translation Platform for Estonian Government	Jun 1, 2022	Document TranslationGrammatical Error Correction	—Unverified
Improving Grammatical Error Correction for Multiword Expressions	Jun 1, 2022	DecoderGrammatical Error Correction	—Unverified

Show:10 25 50

← PrevPage 3 of 9Next →

All datasets CoNLL-2014 Shared Task BEA-2019 (test)Falko-MERLIN JFLEG UA-GEC CoNLL-2014 Shared Task (10 annotations)Restricted Unrestricted _Restricted_EstGEC-L2 FCGEC MuCGEC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ensembles of best 7 models + GRECO + GTP-rerank	F0.5	72.8	—	Unverified
2	Majority-voting ensemble on best 7 models	F0.5	71.8	—	Unverified
3	GRECO (voting+ESC)	F0.5	71.12	—	Unverified
4	GEC-DI (LM+GED)	F0.5	69.6	—	Unverified
5	Unsupervised GEC + cLang8	F0.5	69.6	—	Unverified
6	ESC	F0.5	69.51	—	Unverified
7	T5	F0.5	68.87	—	Unverified
8	MoECE	F0.5	67.79	—	Unverified
9	SynGEC	F0.5	67.6	—	Unverified
10	Sequence tagging + token-level transformations + two-stage fine-tuning (+BERT, RoBERTa, XLNet)	F0.5	66.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Majority-voting ensemble on best 7 models	F0.5	81.4	—	Unverified
2	GRECO (voting+ESC)	F0.5	80.84	—	Unverified
3	ESC	F0.5	79.9	—	Unverified
4	RedPenNet	F0.5	77.6	—	Unverified
5	clang_large_ft2-gector	F0.5	77.1	—	Unverified
6	Unsupervised GEC + cLang8	F0.5	76.5	—	Unverified
7	DeBERTa + RoBERTa + XLNet	F0.5	76.05	—	Unverified
8	MoECE	F0.5	74.07	—	Unverified
9	Sequence tagging + token-level transformations + two-stage fine-tuning (+RoBERTa, XLNet)	F0.5	73.7	—	Unverified
10	BEA Combination	F0.5	73.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	76.75	—	Unverified
2	mT5-based multimodal MoE	F0.5	76.3	—	Unverified
3	gT5 xxl	F0.5	75.96	—	Unverified
4	Transformer	F0.5	73.71	—	Unverified
5	Transformer - synthetic pretrain only	F0.5	51.41	—	Unverified
6	Multilayer Convolutional Encoder-Decoder	F0.5	43.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VERNet	GLEU	62.1	—	Unverified
2	Transformer + Pre-train with Pseudo Data + BERT	GLEU	62	—	Unverified
3	SMT + BiGRU	GLEU	61.5	—	Unverified
4	Copy-augmented Model (4 Ensemble +Denoising Autoencoder)	GLEU	61	—	Unverified
5	Transformer	GLEU	59.9	—	Unverified
6	CNN Seq2Seq	GLEU	57.47	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	74.09	—	Unverified
2	mBART-based model with synthetic data	F0.5	68.17	—	Unverified
3	mT5 large + 10M synth	F0.5	68.09	—	Unverified
4	RedPenNet	F0.5	67.71	—	Unverified
5	ChatGPT (zero-shot)	F0.5	27.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GRECO (vote+ESC)	F0.5	85.21	—	Unverified
2	SMT + BiGRU	F0.5	72.04	—	Unverified
3	CNN Seq2Seq	F0.5	70.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN Seq2Seq + Quality Estimation	F0.5	56.52	—	Unverified
2	Transformer	F0.5	55.8	—	Unverified
3	+ BIFI with no critic	F0.5	18.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN Seq2Seq + Fluency Boost and inference	GLEU	62.37	—	Unverified
2	CNN Seq2Seq + Fluency Boost	F0.5	61.34	—	Unverified
3	+ BIFI (ours)	F0.5	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer	GLEU	59.9	—	Unverified
2	CNN Seq2Seq	GLEU	57.47	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Llama + 1M BT + gold	F0.5	69.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STG-Joint	exact match	34.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GEC-DI (LM+GED)	F0.5	48.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RedPenNet	F0.5	77.6	—	Unverified