SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 101110 of 794 papers

TitleStatusHype
Investigating model performance in language identification: beyond simple error statisticsCode0
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarizationCode0
Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual CommunitiesCode0
Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languagesCode1
An Open Dataset and Model for Language IdentificationCode1
Multilingual Large Language Models Are Not (Yet) Code-Switchers0
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ LanguagesCode0
Scaling Speech Technology to 1,000+ LanguagesCode1
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark0
DocLangID: Improving Few-Shot Training to Identify the Language of Historical DocumentsCode0
Show:102550
← PrevPage 11 of 80Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified