SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 1120 of 794 papers

TitleStatusHype
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data FiltersCode1
Improving Spoken Language Identification with Map-MixCode1
Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languagesCode1
BERT-LID: Leveraging BERT to Improve Spoken Language IdentificationCode1
DravidianCodeMix: Sentiment Analysis and Offensive Language Identification Dataset for Dravidian Languages in Code-Mixed TextCode1
An Open Dataset and Model for Language IdentificationCode1
AfroLID: A Neural Language Identification Tool for African LanguagesCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
FastSpell: the LangId Magic SpellCode1
Hyperseed: Unsupervised Learning with Vector Symbolic ArchitecturesCode1
Show:102550
← PrevPage 2 of 80Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified