SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 110 of 794 papers

TitleStatusHype
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-trainingCode11
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource ScenariosCode2
MathPile: A Billion-Token-Scale Pretraining Corpus for MathCode2
TweetNLP: Cutting-Edge Natural Language Processing for Social MediaCode2
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority LanguagesCode1
Language-Informed Beam Search Decoding for Multilingual Machine TranslationCode1
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and BeyondCode1
MaskLID: Code-Switching Language Identification through Iterative MaskingCode1
FastSpell: the LangId Magic SpellCode1
Language and Speech Technology for Central Kurdish VarietiesCode1
Show:102550
← PrevPage 1 of 80Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified