SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 151200 of 794 papers

TitleStatusHype
Code Mixing: A Challenge for Language Identification in the Language of Social Media0
Code Switched and Code Mixed Speech Recognition for Indic languages0
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification0
Code-Switched Named Entity Recognition with Embedding Attention0
Codeswitching language identification using Subword Information Enriched Word Vectors0
Code-Switching Ubique Est - Language Identification and Part-of-Speech Tagging for Historical Mixed Text0
Codewithzichao@DravidianLangTech-EACL2021: Exploring Multilingual Transformers for Offensive Language Identification on Code Mixing Text0
Cognate and Misspelling Features for Natural Language Identification0
Cognitive Computing to Optimize IT Services0
CoLi at UdS at SemEval-2020 Task 12: Offensive Tweet Detection with Ensembling0
CoLI-Machine Learning Approaches for Code-mixed Language Identification at the Word Level in Kannada-English Texts0
Collecting Code-Switched Data from Social Media0
Columbia-Jadavpur submission for EMNLP 2016 Code-Switching Workshop Shared Task: System description0
Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech0
Combining Shallow and Linguistically Motivated Features in Native Language Identification0
Combining Textual and Speech Features in the NLI Task Using State-of-the-Art Machine Learning Techniques0
COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing0
ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Task at ICON-20210
DeepAnalyzer at SemEval-2019 Task 6: A deep learning-based ensemble method for identifying offensive tweets0
Comparing Approaches to Dravidian Language Identification0
Comparing Approaches to the Identification of Similar Languages0
Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition0
Comparing Two Basic Methods for Discriminating Between Similar Languages and Varieties0
Computational Approaches to Arabic-English Code-Switching0
Computationally efficient discrimination between language varieties with large feature vectors and regularized classifiers0
Confidence-based Ensembles of End-to-End Speech Recognition Models0
ConvAI at SemEval-2019 Task 6: Offensive Language Identification and Categorization with Perspective and BERT0
Coreference Resolution in FreeLing 4.00
Corpora of social media in minority Uralic languages0
Corpus Creation and Language Identification in Low-Resource Code-Mixed Telugu-English Text0
CoSwID, a Code Switching Identification Method Suitable for Under-Resourced Languages0
Challenges of Computational Processing of Code-Switching0
Automatic Detection of Code-switching Style from Acoustics0
Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages0
Cross-Corpora Spoken Language Identification with Domain Diversification and Generalization0
Cross-corpus Native Language Identification via Statistical Embedding0
Automatic Detection of Sentence Fragments0
Cross-domain Feature Selection for Language Identification0
Cross-lingual Inductive Transfer to Detect Offensive Language0
An Exploratory Analysis of the Relation Between Offensive Language and Mental Health0
Challenges in Neural Language Identification: NRC at VarDial 20200
cs@DravidianLangTech-EACL2021: Offensive Language Identification Based On Multilingual BERT Model0
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages0
Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks0
A Report on the VarDial Evaluation Campaign 20200
CUSATNLP@HASOC-Dravidian-CodeMix-FIRE2020:Identifying Offensive Language from ManglishTweets0
Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach0
Data Filtering using Cross-Lingual Word Embeddings0
DCU-UVT: Word-Level Language Classification with Code-Mixed Data0
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language0
Show:102550
← PrevPage 4 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified