SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 551600 of 794 papers

TitleStatusHype
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification0
Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses0
String Kernels for Native Language Identification: Insights from Behind the Curtains0
Subdialectal Differences in Sorani Kurdish0
Subsegmental language detection in Celtic language text0
Subword-Level Language Identification for Intra-Word Code-Switching0
SU-NLP at SemEval-2020 Task 12: Offensive Language IdentifiCation in Turkish Tweets0
SwissAdmin: A multilingual tagged parallel corpus of press releases0
T\"ubingen-Oslo at SemEval-2018 Task 2: SVMs perform better than RNNs in Emoji Prediction0
T\"ubingen-Oslo Team at the VarDial 2018 Evaluation Campaign: An Analysis of N-gram Features in Language Variety Identification0
T\"ubingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing0
Tackling the Score Shift in Cross-Lingual Speaker Verification by Exploiting Language Information0
TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge0
Team Rouges at SemEval-2020 Task 12: Cross-lingual Inductive Transfer to Detect Offensive Language0
TECHSSN at SemEval-2020 Task 12: Offensive Language Detection Using BERT Embeddings0
TechSSN at SemEval-2022 Task 6: Intended Sarcasm Detection using Transformer Models0
Text Normalization Infrastructure that Scales to Hundreds of Language Varieties0
Text segmentation for Language Identification in Greek Forums0
The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results0
The CMU Submission for the Shared Task on Language Identification in Code-Switched Data0
The French-Algerian Code-Switching Triggered audio corpus (FACST)0
The futility of STILTs for the classification of lexical borrowings in Spanish0
The Howard University System Submission for the Shared Task in Language Identification in Spanish-English Codeswitching0
The ILSP/ARC submission to the WMT 2016 Bilingual Document Alignment Shared Task0
The IUCL+ System: Word-Level Language Identification via Extended Markov Models0
The Jinan Chinese Learner Corpus0
The MERLIN corpus: Learner language and the CEFR0
The Mysterious Letter J0
The NRC System for Discriminating Similar Languages0
The Obscure Limitation of Modular Multilingual Language Models0
The Power of Character N-grams in Native Language Identification0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
The Role of Emotions in Native Language Identification0
The Story of the Characters, the DNA and the Native Language0
The Titans at SemEval-2019 Task 6: Offensive Language Identification, Categorization and Target Identification0
TLAXCALA: a multilingual corpus of independent news0
Token Masking Improves Transformer-Based Text Classification0
Towards a Common Speech Analysis Engine0
Towards End-to-End Code-Switching Speech Recognition0
Towards Generalized Offensive Language Identification0
Towards Language Technology for Mi'kmaq0
Towards Relevance and Sequence Modeling in Language Recognition0
Towards spoken dialect identification of Irish0
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer0
Transducer-based language embedding for spoken language identification0
Transductive Learning with String Kernels for Cross-Domain Text Classification0
Transformer-based Model for Word Level Language Identification in Code-mixed Kannada-English Texts0
Translated Texts Under the Lens: From Machine Translation Detection to Source Language Identification0
Translationese: Between Human and Machine Translation0
Transliteration Better than Translation? Answering Code-mixed Questions over a Knowledge Base0
Show:102550
← PrevPage 12 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified