SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 551600 of 794 papers

TitleStatusHype
Fewer features perform well at Native Language Identification task0
Combining Textual and Speech Features in the NLI Task Using State-of-the-Art Machine Learning Techniques0
Stacked Sentence-Document Classifier Approach for Improving Native Language Identification0
A Dataset and Classifier for Recognizing Social Media English0
A Report on the 2017 Native Language Identification Shared Task0
A Shallow Neural Network for Native Language Identification with Character N-grams0
A study of N-gram and Embedding Representations for Native Language IdentificationCode0
Classifier Stacking for Native Language Identification0
Ensemble Methods for Native Language Identification0
Exploring Optimal Voting in Native Language Identification0
Fusion of Simple Models for Native Language Identification0
All that is English may be Hindi: Enhancing language identification through automatic ranking of the likeliness of word borrowing in social media0
Vector Space Model as Cognitive Space for Text Classification0
Language Identification Using Deep Convolutional Recurrent Neural NetworksCode0
Lump at SemEval-2017 Task 1: Towards an Interlingua Semantic Similarity0
Can string kernels pass the test of time in Native Language Identification?0
All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media0
Native Language Identification on Text and Speech0
Open-Set Language Identification0
Feature Hashing for Language and Dialect Identification0
Improving Native Language Identification by Using Spelling Errors0
Incorporating Dialectal Variability for Socially Equitable Language Identification0
Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English0
Phone-aware Neural Language Identification0
Phonetic Temporal Neural Model for Language Identification0
Machine Learning for Rhetorical Figure Detection: More Chiasmus with Less Annotation0
Evaluation of language identification methods using 285 languages0
Learning with learner corpora: Using the TLE for native language identification0
Joint UD Parsing of Norwegian Bokm and NynorskCode0
Identification of Languages in Algerian Arabic Multilingual Documents0
CLUZH at VarDial GDI 2017: Testing a Variety of Machine Learning Tools for the Classification of Swiss German Dialects0
Improving the Character Ngram Model for the DSL Task with BM25 Weighting and Less Frequently Used Feature Sets0
Exploring Lexical and Syntactic Features for Language Variety Identification0
Discriminating between Similar Languages using Weighted Subword FeaturesCode0
When Sparse Traditional Models Outperform Dense Neural Networks: the Curious Case of Discriminating between Similar Languages0
Twitter Language Identification Of Similar Languages And Dialects Without Ground Truth0
Discriminating between Similar Languages with Word-level Convolutional Neural Networks0
T\"ubingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing0
A Code-Switching Corpus of Turkish-German Conversations0
A Perplexity-Based Method for Similar Languages Discrimination0
Findings of the VarDial Evaluation Campaign 20170
Evaluating HeLI with Non-Linear Mappings0
URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors0
Native Language Identification using Stacked Generalization0
Machine Learning Based Source Code Classification Using Syntax Oriented Features0
LIDE: Language Identification from Text Documents0
LanideNN: Multilingual Language Identification on Character WindowCode0
Translationese: Between Human and Machine Translation0
Advancing Linguistic Features and Insights by Label-informed Feature Grouping: An Exploration in the Context of Native Language Identification0
LILI: A Simple Language Independent Approach for Language Identification0
Show:102550
← PrevPage 12 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified