SOTAVerified

Grammatical Error Correction

Grammatical Error Correction (GEC) is the task of correcting different kinds of errors in text such as spelling, punctuation, grammatical, and word choice errors.

GEC is typically formulated as a sentence correction task. A GEC system takes a potentially erroneous sentence as input and is expected to transform it to its corrected version. See the example given below:

| Input (Erroneous) | Output (Corrected) | | ------------------------- | ---------------------- | |She see Tom is catched by policeman in park at last night. | She saw Tom caught by a policeman in the park last night.|

Papers

Showing 251300 of 415 papers

TitleStatusHype
The Effect of Error Rate in Artificially Generated Data for Automatic Preposition and Determiner Correction0
The Effect of Learner Corpus Size in Grammatical Error Correction of ESL Writings0
The Illinois-Columbia System in the CoNLL-2014 Shared Task0
The LAIX Systems in the BEA-2019 GEC Shared Task0
The Unbearable Weight of Generating Artificial Errors for Grammatical Error Correction0
The Write & Improve Corpus 2024: Error-annotated and CEFR-labelled essays by learners of English0
Tibyan Corpus: Balanced and Comprehensive Error Coverage Corpus Using ChatGPT for Arabic Grammatical Error Correction0
TMU-NLP System Using BERT-based Pre-trained Model to the NLP-TEA CGED Shared Task 20200
TMU Transformer System Using BERT for Re-ranking at BEA 2019 Grammatical Error Correction on Restricted Track0
Toward More Precision in Correction of Grammatical Errors0
Towards End-to-End Spoken Grammatical Error Correction0
Towards Minimal Supervision BERT-based Grammar Error Correction0
Towards Universal Dependencies for Learner Chinese0
Towards Unsupervised Grammatical Error Correction using Statistical Machine Translation with Synthetic Comparable Corpus0
Treelet Probabilities for HPSG Parsing and Error Correction0
Tuning a Grammar Correction System for Increased Precision0
UdS at CoNLL 2013 Shared Task0
UM-Checker: A Hybrid System for English Grammatical Error Correction0
Uncertainty Determines the Adequacy of the Mode and the Tractability of Decoding in Sequence-to-Sequence Models0
Ungrammatical-syntax-based In-context Example Selection for Grammatical Error Correction0
Using Context in Neural Machine Translation Training Objectives0
UW-Stanford System Description for AESW 2016 Shared Task on Grammatical Error Detection0
Weakly Supervised Grammatical Error Correction using Iterative Decoding0
LFG-based Features for Noun Number and Article Grammatical Errors0
A BERT-based Unsupervised Grammatical Error Correction Framework0
A Chinese Writing Correction System for Learning Chinese as a Foreign Language0
A Comparative Study of Synthetic Data Generation Methods for Grammatical Error Correction0
A Crash Course in Automatic Grammatical Error Correction0
Adapting Grammatical Error Correction Based on the Native Language of Writers with Neural Network Joint Models0
Addressing Class Imbalance in Grammatical Error Detection with Evaluation Metric Optimization0
Adversarial Grammatical Error Correction0
A Hybrid Model For Grammatical Error Correction0
A Hybrid System for Chinese Grammatical Error Diagnosis and Correction0
A Language Model for Grammatical Error Correction in L2 Russian0
(Almost) Unsupervised Grammatical Error Correction using Synthetic Comparable Corpus0
A Meta Learning Approach to Grammatical Error Correction0
Analyzing the Impact of Spelling Errors on POS-Tagging and Chunking in Learner English0
Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error Correction0
An Annotated Corpus of Picture Stories Retold by Language Learners0
An Automatic Error Tagger for German0
A Nested Attention Neural Hybrid Model for Grammatical Error Correction0
A New Evaluation Method: Evaluation Data and Metrics for Chinese Grammar Error Correction0
A Pipeline Approach to Supervised Error Correction for the QALB-2014 Shared Task0
A POS Tagging Model Adapted to Learner English0
A Report on the 2017 Native Language Identification Shared Task0
A Report on the Automatic Evaluation of Scientific Writing Shared Task0
Artificial Error Generation with Fluency Filtering0
A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction0
A Simple but Effective Classification Model for Grammatical Error Correction0
A Syntax-Guided Grammatical Error Correction Model with Dependency Tree Correction0
Show:102550
← PrevPage 6 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Ensembles of best 7 models + GRECO + GTP-rerankF0.572.8Unverified
2Majority-voting ensemble on best 7 modelsF0.571.8Unverified
3GRECO (voting+ESC)F0.571.12Unverified
4GEC-DI (LM+GED)F0.569.6Unverified
5Unsupervised GEC + cLang8F0.569.6Unverified
6ESCF0.569.51Unverified
7T5F0.568.87Unverified
8MoECEF0.567.79Unverified
9SynGECF0.567.6Unverified
10Sequence tagging + token-level transformations + two-stage fine-tuning (+BERT, RoBERTa, XLNet)F0.566.5Unverified
#ModelMetricClaimedVerifiedStatus
1Majority-voting ensemble on best 7 modelsF0.581.4Unverified
2GRECO (voting+ESC)F0.580.84Unverified
3ESCF0.579.9Unverified
4RedPenNetF0.577.6Unverified
5clang_large_ft2-gectorF0.577.1Unverified
6Unsupervised GEC + cLang8F0.576.5Unverified
7DeBERTa + RoBERTa + XLNetF0.576.05Unverified
8MoECEF0.574.07Unverified
9Sequence tagging + token-level transformations + two-stage fine-tuning (+RoBERTa, XLNet)F0.573.7Unverified
10BEA CombinationF0.573.2Unverified
#ModelMetricClaimedVerifiedStatus
1Llama + 1M BT + goldF0.576.75Unverified
2mT5-based multimodal MoEF0.576.3Unverified
3gT5 xxlF0.575.96Unverified
4TransformerF0.573.71Unverified
5Transformer - synthetic pretrain onlyF0.551.41Unverified
6Multilayer Convolutional Encoder-DecoderF0.543.35Unverified
#ModelMetricClaimedVerifiedStatus
1VERNetGLEU62.1Unverified
2Transformer + Pre-train with Pseudo Data + BERTGLEU62Unverified
3SMT + BiGRUGLEU61.5Unverified
4Copy-augmented Model (4 Ensemble +Denoising Autoencoder)GLEU61Unverified
5TransformerGLEU59.9Unverified
6CNN Seq2SeqGLEU57.47Unverified
#ModelMetricClaimedVerifiedStatus
1Llama + 1M BT + goldF0.574.09Unverified
2mBART-based model with synthetic dataF0.568.17Unverified
3mT5 large + 10M synthF0.568.09Unverified
4RedPenNetF0.567.71Unverified
5ChatGPT (zero-shot)F0.527.4Unverified
#ModelMetricClaimedVerifiedStatus
1GRECO (vote+ESC)F0.585.21Unverified
2SMT + BiGRUF0.572.04Unverified
3CNN Seq2SeqF0.570.14Unverified
#ModelMetricClaimedVerifiedStatus
1CNN Seq2Seq + Quality EstimationF0.556.52Unverified
2TransformerF0.555.8Unverified
3+ BIFI with no criticF0.518.7Unverified
#ModelMetricClaimedVerifiedStatus
1CNN Seq2Seq + Fluency Boost and inferenceGLEU62.37Unverified
2CNN Seq2Seq + Fluency BoostF0.561.34Unverified
3+ BIFI (ours)F0.542.4Unverified
#ModelMetricClaimedVerifiedStatus
1TransformerGLEU59.9Unverified
2CNN Seq2SeqGLEU57.47Unverified
#ModelMetricClaimedVerifiedStatus
1Llama + 1M BT + goldF0.569.97Unverified
#ModelMetricClaimedVerifiedStatus
1STG-Jointexact match34.1Unverified
#ModelMetricClaimedVerifiedStatus
1GEC-DI (LM+GED)F0.548.61Unverified
#ModelMetricClaimedVerifiedStatus
1RedPenNetF0.577.6Unverified