SOTAVerified

Lexical Normalization

Lexical normalization is the task of translating/transforming a non standard text to a standard register.

Example:

new pix comming tomoroe
new pictures coming tomorrow

Datasets usually consists of tweets, since these naturally contain a fair amount of these phenomena.

For lexical normalization, only replacements on the word-level are annotated. Some corpora include annotation for 1-N and N-1 replacements. However, word insertion/deletion and reordering is not part of the task.

Papers

Showing 4147 of 47 papers

TitleStatusHype
Automatic Textual Normalization for Hate Speech DetectionCode0
Modeling Input Uncertainty in Neural Network Dependency ParsingCode0
MoNoise: A Multi-lingual and Easy-to-use Lexical Normalization ToolCode0
MoNoise: Modeling Noise Using a Modular Normalization SystemCode0
DaN+: Danish Nested Named Entities and Lexical NormalizationCode0
MultiLexNorm: A Shared Task on Multilingual Lexical NormalizationCode0
User-Generated Text Corpus for Evaluating Japanese Morphological Analysis and Lexical NormalizationCode0
Show:102550
← PrevPage 5 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MoNoiseAccuracy87.63Unverified
2Syllable basedAccuracy86.08Unverified
3TextNormAccuracy83.94Unverified
4unLOLAccuracy82.06Unverified