SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 551575 of 794 papers

TitleStatusHype
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification0
Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses0
String Kernels for Native Language Identification: Insights from Behind the Curtains0
Subdialectal Differences in Sorani Kurdish0
Subsegmental language detection in Celtic language text0
Subword-Level Language Identification for Intra-Word Code-Switching0
SU-NLP at SemEval-2020 Task 12: Offensive Language IdentifiCation in Turkish Tweets0
SwissAdmin: A multilingual tagged parallel corpus of press releases0
T\"ubingen-Oslo at SemEval-2018 Task 2: SVMs perform better than RNNs in Emoji Prediction0
T\"ubingen-Oslo Team at the VarDial 2018 Evaluation Campaign: An Analysis of N-gram Features in Language Variety Identification0
T\"ubingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing0
Tackling the Score Shift in Cross-Lingual Speaker Verification by Exploiting Language Information0
TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge0
Team Rouges at SemEval-2020 Task 12: Cross-lingual Inductive Transfer to Detect Offensive Language0
TECHSSN at SemEval-2020 Task 12: Offensive Language Detection Using BERT Embeddings0
TechSSN at SemEval-2022 Task 6: Intended Sarcasm Detection using Transformer Models0
Text Normalization Infrastructure that Scales to Hundreds of Language Varieties0
Text segmentation for Language Identification in Greek Forums0
The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results0
The CMU Submission for the Shared Task on Language Identification in Code-Switched Data0
The French-Algerian Code-Switching Triggered audio corpus (FACST)0
The futility of STILTs for the classification of lexical borrowings in Spanish0
The Howard University System Submission for the Shared Task in Language Identification in Spanish-English Codeswitching0
The ILSP/ARC submission to the WMT 2016 Bilingual Document Alignment Shared Task0
The IUCL+ System: Word-Level Language Identification via Extended Markov Models0
Show:102550
← PrevPage 23 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified