SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 721730 of 794 papers

TitleStatusHype
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish0
SwissAdmin: A multilingual tagged parallel corpus of press releases0
TweetCaT: a tool for building Twitter corpora of smaller languagesCode0
The MERLIN corpus: Learner language and the CEFR0
Improving the exploitation of linguistic annotations in ELAN0
TLAXCALA: a multilingual corpus of independent news0
Finding Romanized Arabic Dialect in Code-Mixed Tweets0
Native Language Identification Using Large, Longitudinal Data0
Statistical Analysis of Multilingual Text Corpus and Development of Language Models0
GlobalPhone: Pronunciation Dictionaries in 20 Languages0
Show:102550
← PrevPage 73 of 80Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified