SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 151200 of 794 papers

TitleStatusHype
AlexU-BackTranslation-TL at SemEval-2020 Task 12: Improving Offensive Language Detection Using Data Augmentation and Transfer Learning0
BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive Language Identification in Social Media0
Bootstrapping a historical commodities lexicon with SKOS and DBpedia0
Arabic Dialect Identification in the Context of Bivalency and Code-Switching0
BNU-HKBU UIC NLP Team 2 at SemEval-2019 Task 6: Detecting Offensive Language Using BERT model0
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss0
A Pre-trained Transformer and CNN Model with Joint Language ID and Part-of-Speech Tagging for Code-Mixed Social-Media Text0
Albanian Language Identification in Text Documents0
A deep-learning based native-language classification by using a latent semantic analysis for the NLI Shared Task 20170
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition0
BhamNLP at SemEval-2020 Task 12: An Ensemble of Different Word Embeddings and Emotion Transfer Learning for Arabic Offensive Language Identification in Social Media0
BFCAI at ComMA@ICON 2021: Support Vector Machines for Multilingual Gender Biased and Communal Language Identification0
A Portuguese Native Language Identification Dataset0
SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification0
Beware Haters at ComMA@ICON: Sequence and Ensemble Classifiers for Aggression, Gender Bias and Communal Bias Identification in Indian Languages0
A Perplexity-Based Method for Similar Languages Discrimination0
BERT-based Multi-Task Model for Country and Province Level MSA and Dialectal Arabic Identification0
BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification0
An Unsupervised Morphological Criterion for Discriminating Similar Languages0
A language model based approach towards large scale and lightweight language identification systems0
A Deep Generative Approach to Native Language Identification0
A Code-Switching Corpus of Turkish-German Conversations0
Beefmoves: Dissemination, Diversity, and Dynamics of English Borrowings in a German Hip Hop Forum0
Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search0
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective0
Automatic Token and Turn Level Language Identification for Code-Switched Text Dialog: An Analysis Across Language Pairs and Corpora0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification0
Automatic Spoken Language Identification Utilizing Acoustic and Phonetic Speech Information0
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish0
AUTOMATIC LANGUAGE IDENTIFICATION USING DEEP NEURAL NETWORKS0
Automatic Language Identification System for Hindi and Magahi0
Annotation Efficient Language Identification from Weak Labels0
Addition of Code Mixed Features to Enhance the Sentiment Prediction of Song Lyrics0
Automatic Language Identification for Romance Languages using Stop Words and Diacritics0
Automatic Language Identification for Celtic Texts0
DCU-UVT: Word-Level Language Classification with Code-Mixed Data0
Automatic language identification0
Anlirika: An LSTM–CNN Flow Twister for Spoken Language Identification0
Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach0
CUSATNLP@HASOC-Dravidian-CodeMix-FIRE2020:Identifying Offensive Language from ManglishTweets0
CUSATNLP@DravidianLangTech-EACL2021:Language Agnostic Classification of Offensive Content in Tweets0
Automatic Identification of Learners' Language Background Based on Their Writing in Czech0
Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks0
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages0
Automatic Identification of Closely-related Indian Languages: Resources and Experiments0
cs@DravidianLangTech-EACL2021: Offensive Language Identification Based On Multilingual BERT Model0
Data Filtering using Cross-Lingual Word Embeddings0
Cross-Linguistic Offensive Language Detection: BERT-Based Analysis of Bengali, Assamese, & Bodo Conversational Hateful Content from Social Media0
Automatic discovery of Latin syntactic changes0
Show:102550
← PrevPage 4 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified