To Err Is Human, but Llamas Can Learn It Too

2024-03-08Code Available0· sign in to hype

Agnes Luhtaru, Taido Purason, Martin Vainikko, Maksym Del, Mark Fishel

Code Available — Be the first to reproduce this paper.

Code

github.com/TartuNLP/gec-llm
Officialpytorch★ 1

Abstract

This study explores enhancing grammatical error correction (GEC) through artificial error generation (AEG) using language models (LMs). Specifically, we fine-tune Llama 2-based LMs for error generation and find that this approach yields synthetic errors akin to human errors. Next, we train GEC Llama models with the help of these artificial errors and outperform previous state-of-the-art error correction models, with gains ranging between 0.8 and 6 F0.5 points across all tested languages (German, Ukrainian, and Estonian). Moreover, we demonstrate that generating errors by fine-tuning smaller sequence-to-sequence models and prompting large commercial LMs (GPT-3.5 and GPT-4) also results in synthetic errors beneficially affecting error generation models.

Tasks

Grammatical Error Correction

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
EstGEC-L2	Llama + 1M BT + gold	F0.5	69.97	—	Unverified
Falko-MERLIN	Llama + 1M BT + gold	F0.5	76.75	—	Unverified
UA-GEC	Llama + 1M BT + gold	F0.5	74.09	—	Unverified

To Err Is Human, but Llamas Can Learn It Too

Code

Abstract

Tasks

Benchmark Results

Reproductions