SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 101125 of 794 papers

TitleStatusHype
AfriHuBERT: A self-supervised speech representation model for African languagesCode0
Automatic Language Identification in Texts: A SurveyCode0
Enhance Language Identification using Dual-mode Model with Knowledge DistillationCode0
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarizationCode0
From N-grams to Pre-trained Multilingual Models For Language IdentificationCode0
Multi-label Scandinavian Language Identification (SLIDE)Code0
Aggressive Language Identification Using Word Embeddings and Sentiment FeaturesCode0
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation BenchmarksCode0
Discriminating Between Similar Nordic LanguagesCode0
Offensive Language Identification in GreekCode0
On the Language Neutrality of Pre-trained Multilingual RepresentationsCode0
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuningCode0
Discriminating between Similar Languages using Weighted Subword FeaturesCode0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesCode0
Predicting the Type and Target of Offensive Social Media Posts in MarathiCode0
problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approachesCode0
Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of MarathiCode0
Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic LanguagesCode0
CyberTronics at SemEval-2020 Task 12: Multilingual Offensive Language Identification over Social MediaCode0
Short Text Language Identification for Under Resourced LanguagesCode0
DocLangID: Improving Few-Shot Training to Identify the Language of Historical DocumentsCode0
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and PostsCode0
Comparing the Performance of CNNs and Shallow Models for Language IdentificationCode0
Word-level Embeddings for Cross-Task Transfer Learning in Speech ProcessingCode0
Code-Switched Language Identification is Harder Than You ThinkCode0
Show:102550
← PrevPage 5 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified