SOTAVerified

Lexical Normalization

Lexical normalization is the task of translating/transforming a non standard text to a standard register.

Example:

new pix comming tomoroe
new pictures coming tomorrow

Datasets usually consists of tweets, since these naturally contain a fair amount of these phenomena.

For lexical normalization, only replacements on the word-level are annotated. Some corpora include annotation for 1-N and N-1 replacements. However, word insertion/deletion and reordering is not part of the task.

Papers

Showing 2130 of 47 papers

TitleStatusHype
Norm It! Lexical Normalization for Italian and Its Downstream Effects for Dependency Parsing0
A Clustering Framework for Lexical Normalization of Roman UrduCode0
Adapting Deep Learning for Sentiment Classification of Code-Switched Informal Short TextCode0
A Multi-cascaded Deep Model for Bilingual SMS ClassificationCode0
Enhancing BERT for Lexical Normalization0
An In-depth Analysis of the Effect of Lexical Normalization on the Dependency Parsing of Social Media0
Normalization of Indonesian-English Code-Mixed Twitter Data0
Lexical Normalization of User-Generated Medical Text0
MoNoise: A Multi-lingual and Easy-to-use Lexical Normalization ToolCode0
Adapting Sequence to Sequence models for Text Normalization in Social MediaCode0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MoNoiseAccuracy87.63Unverified
2Syllable basedAccuracy86.08Unverified
3TextNormAccuracy83.94Unverified
4unLOLAccuracy82.06Unverified