SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 651700 of 794 papers

TitleStatusHype
Wordnet-Based Cross-Language Identification of Semantic Relations0
XD at SemEval-2020 Task 12: Ensemble Approach to Offensive Language Identification in Social Media Using Transformer Encoders0
Yet Another Language Identifier0
``ye word kis lang ka hai bhai?'' Testing the Limits of Word level Language Identification0
mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks0
ZYJ123@DravidianLangTech-EACL2021: Offensive Language Identification based on XLM-RoBERTa with DPCNN0
Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?0
Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models0
Accurate Language Identification of Twitter Messages0
Accurate Pinyin-English Codeswitched Language Identification0
A Code-Switching Corpus of Turkish-German Conversations0
A Comparison of Character Neural Language Model and Bootstrapping for Language Identification in Multilingual Noisy Texts0
Acoustic characterization of speech rhythm: going beyond metrics with recurrent neural networks0
Active learning and negative evidence for language identification0
Adaptation de domaine non supervis\'ee pour la reconnaissance de la langue par r\'egularisation d'un r\'eseau de neurones (Unsupervised domain adaptation for language identification by regularization of a neural network)0
A Dataset and Classifier for Recognizing Social Media English0
Addition of Code Mixed Features to Enhance the Sentiment Prediction of Song Lyrics0
A Deep Generative Approach to Native Language Identification0
A deep-learning based native-language classification by using a latent semantic analysis for the NLI Shared Task 20170
A Dual-Decoder Conformer for Multilingual Speech Recognition0
Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition0
Advancing Linguistic Features and Insights by Label-informed Feature Grouping: An Exploration in the Context of Native Language Identification0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
Adversarial Training for Multilingual Acoustic Modeling0
A Fast, Compact, Accurate Model for Language Identification of Codemixed Text0
A Federated Learning Approach to Privacy Preserving Offensive Language Identification0
A language model based approach towards large scale and lightweight language identification systems0
SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification0
Albanian Language Identification in Text Documents0
AlexU-BackTranslation-TL at SemEval-2020 Task 12: Improving Offensive Language Detection Using Data Augmentation and Transfer Learning0
Alibaba Submission to the WMT20 Parallel Corpus Filtering Task0
Aligning Speech to Languages to Enhance Code-switching Speech Recognition0
All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media0
All that is English may be Hindi: Enhancing language identification through automatic ranking of the likeliness of word borrowing in social media0
ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media0
A Mandarin-English Code-Switching Corpus0
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification0
American Sign Language Identification Using Hand Trackpoint Analysis0
Amrita_CEN_NLP@DravidianLangTech-EACL2021: Deep Learning-based Offensive Language Identification in Malayalam, Tamil and Kannada0
A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets0
Analysis of Named Entity Recognition and Linking for Tweets0
Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance0
An Assessment of Language Identification Methods on Tweets and Wikipedia Articles0
An Attention Based Neural Network for Code Switching Detection: English & Roman Urdu0
A Neural Model for Language Identification in Code-Switched Tweets0
An Exploratory Analysis of the Relation Between Offensive Language and Mental Health0
Anglicized Words and Misspelled Cognates in Native Language Identification0
Anlirika: An LSTM–CNN Flow Twister for Spoken Language Identification0
Annotation Efficient Language Identification from Weak Labels0
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification0
Show:102550
← PrevPage 14 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified