SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 51100 of 794 papers

TitleStatusHype
A Federated Learning Approach to Privacy Preserving Offensive Language Identification0
A Dataset and Classifier for Recognizing Social Media English0
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification0
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective0
An Unsupervised Morphological Criterion for Discriminating Similar Languages0
A Perplexity-Based Method for Similar Languages Discrimination0
A Portuguese Native Language Identification Dataset0
SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification0
A Pre-trained Transformer and CNN Model with Joint Language ID and Part-of-Speech Tagging for Code-Mixed Social-Media Text0
Arabic Dialect Identification in the Context of Bivalency and Code-Switching0
Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts0
Arabic Native Language Identification0
Automatic Identification of Learners' Language Background Based on Their Writing in Czech0
Automatic discovery of Latin syntactic changes0
Anglicized Words and Misspelled Cognates in Native Language Identification0
Automatic Spoken Language Identification Utilizing Acoustic and Phonetic Speech Information0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search0
A Fast, Compact, Accurate Model for Language Identification of Codemixed Text0
A Neural Model for Language Identification in Code-Switched Tweets0
Adaptation de domaine non supervis\'ee pour la reconnaissance de la langue par r\'egularisation d'un r\'eseau de neurones (Unsupervised domain adaptation for language identification by regularization of a neural network)0
An Attention Based Neural Network for Code Switching Detection: English & Roman Urdu0
An Assessment of Language Identification Methods on Tweets and Wikipedia Articles0
Adversarial Training for Multilingual Acoustic Modeling0
Automatic Language Identification System for Hindi and Magahi0
Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance0
Analysis of Named Entity Recognition and Linking for Tweets0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
A Turkish-German Code-Switching Corpus0
A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets0
Active learning and negative evidence for language identification0
Attention-Guided Adaptation for Code-Switching Speech Recognition0
A Twitter BERT Approach for Offensive Language Detection in Marathi0
A Two-level Classifier for Discriminating Similar Languages0
Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition0
Automated essay scoring with string kernels and word embeddings0
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
Automatic Classification of Spoken Languages using Diverse Acoustic Features0
Automatic Detection and Language Identification of Multilingual Documents0
Automatic Detection of Arabicized Berber and Arabic Varieties0
Automatic Detection of Code-switching Style from Acoustics0
Automatic Detection of Intra-Word Code-Switching0
Automatic Detection of Sentence Fragments0
An Exploratory Analysis of the Relation Between Offensive Language and Mental Health0
A Text-to-Text Model for Multilingual Offensive Language Identification0
Automatic Identification of Closely-related Indian Languages: Resources and Experiments0
Amrita_CEN_NLP@DravidianLangTech-EACL2021: Deep Learning-based Offensive Language Identification in Malayalam, Tamil and Kannada0
Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach0
Automatic language identification0
Accurate Language Identification of Twitter Messages0
Show:102550
← PrevPage 2 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified