SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 251300 of 794 papers

TitleStatusHype
Ensemble Methods for Native Language Identification0
Evaluating HeLI with Non-Linear Mappings0
Evaluating Input Representation for Language Identification in Hindi-English Code Mixed Text0
Evaluating Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance0
Cross-Corpora Spoken Language Identification with Domain Diversification and Generalization0
Automatic Detection of Intra-Word Code-Switching0
Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages0
Automatic Detection of Code-switching Style from Acoustics0
A Neural Model for Language Identification in Code-Switched Tweets0
A Fast, Compact, Accurate Model for Language Identification of Codemixed Text0
CoSwID, a Code Switching Identification Method Suitable for Under-Resourced Languages0
Automatic Detection of Arabicized Berber and Arabic Varieties0
Corpus Creation and Language Identification in Low-Resource Code-Mixed Telugu-English Text0
Corpora of social media in minority Uralic languages0
Automatic Detection and Language Identification of Multilingual Documents0
An Attention Based Neural Network for Code Switching Detection: English & Roman Urdu0
Coreference Resolution in FreeLing 4.00
ConvAI at SemEval-2019 Task 6: Offensive Language Identification and Categorization with Perspective and BERT0
Automatic Classification of Spoken Languages using Diverse Acoustic Features0
Confidence-based Ensembles of End-to-End Speech Recognition Models0
Computationally efficient discrimination between language varieties with large feature vectors and regularized classifiers0
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
An Assessment of Language Identification Methods on Tweets and Wikipedia Articles0
Adversarial Training for Multilingual Acoustic Modeling0
Adaptation de domaine non supervis\'ee pour la reconnaissance de la langue par r\'egularisation d'un r\'eseau de neurones (Unsupervised domain adaptation for language identification by regularization of a neural network)0
Computational Approaches to Arabic-English Code-Switching0
Comparing Two Basic Methods for Discriminating Between Similar Languages and Varieties0
Automated essay scoring with string kernels and word embeddings0
Comparing Approaches to the Identification of Similar Languages0
Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition0
Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance0
Comparing Approaches to Dravidian Language Identification0
A Two-level Classifier for Discriminating Similar Languages0
ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Task at ICON-20210
COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing0
A Twitter BERT Approach for Offensive Language Detection in Marathi0
Analysis of Named Entity Recognition and Linking for Tweets0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
Combining Textual and Speech Features in the NLI Task Using State-of-the-Art Machine Learning Techniques0
Combining Shallow and Linguistically Motivated Features in Native Language Identification0
A Turkish-German Code-Switching Corpus0
Columbia-Jadavpur submission for EMNLP 2016 Code-Switching Workshop Shared Task: System description0
A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets0
Collecting Code-Switched Data from Social Media0
CoLI-Machine Learning Approaches for Code-mixed Language Identification at the Word Level in Kannada-English Texts0
Attention-Guided Adaptation for Code-Switching Speech Recognition0
CoLi at UdS at SemEval-2020 Task 12: Offensive Tweet Detection with Ensembling0
Cognitive Computing to Optimize IT Services0
A Text-to-Text Model for Multilingual Offensive Language Identification0
Amrita_CEN_NLP@DravidianLangTech-EACL2021: Deep Learning-based Offensive Language Identification in Malayalam, Tamil and Kannada0
Show:102550
← PrevPage 6 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified