SOTAVerified

Lexical Normalization

Lexical normalization is the task of translating/transforming a non standard text to a standard register.

Example:

new pix comming tomoroe
new pictures coming tomorrow

Datasets usually consists of tweets, since these naturally contain a fair amount of these phenomena.

For lexical normalization, only replacements on the word-level are annotated. Some corpora include annotation for 1-N and N-1 replacements. However, word insertion/deletion and reordering is not part of the task.

Papers

Showing 4147 of 47 papers

TitleStatusHype
Contrastive String Representation Learning using Synthetic Data0
Enhancing BERT for Lexical Normalization0
Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text0
IHS\_RD: Lexical Normalization for English Tweets0
Lexical Normalization for Code-switched Data and its Effect on POS-tagging0
Lexical Normalization of User-Generated Medical Text0
Multilingual Sequence Labeling Approach to solve Lexical Normalization0
Show:102550
← PrevPage 5 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MoNoiseAccuracy87.63Unverified
2Syllable basedAccuracy86.08Unverified
3TextNormAccuracy83.94Unverified
4unLOLAccuracy82.06Unverified