SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 701750 of 794 papers

TitleStatusHype
Language Transfer Hypotheses with Linear SVM Weights0
Non-linear Mapping for Improved Identification of 1300+ Languages0
Exploring Syntactic Features for Native Language Identification: A Variationist Perspective on Feature Encoding and Ensemble Optimization0
Language Family Relationship Preserved in Non-native English0
A Report on the DSL Shared Task 20140
Experiments in Sentence Language Identification with Groups of Similar Languages0
Using Maximum Entropy Models to Discriminate between Similar Languages and Varieties0
The NRC System for Discriminating Similar Languages0
Exploring Methods and Resources for Discriminating Similar Languages0
Subsegmental language detection in Celtic language text0
A survey on phrase structure learning methods for text classification0
Short-Term Projects, Long-Term Benefits: Four Student NLP Projects for Low-Resource Languages0
Unsupervised Feature Learning for Visual Sign Language Identification0
DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data0
Does the Phonology of L1 Show Up in L2 Texts?0
AUTOMATIC LANGUAGE IDENTIFICATION USING DEEP NEURAL NETWORKS0
Facing the Identification Problem in Language-Related Scientific Data Analysis.0
VarClass: An Open-source Language Identification Tool for Language Varieties0
KoKo: an L1 Learner Corpus for German0
Vocabulary-Based Language Similarity using Web Corpora0
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish0
SwissAdmin: A multilingual tagged parallel corpus of press releases0
TweetCaT: a tool for building Twitter corpora of smaller languagesCode0
The MERLIN corpus: Learner language and the CEFR0
Improving the exploitation of linguistic annotations in ELAN0
TLAXCALA: a multilingual corpus of independent news0
Finding Romanized Arabic Dialect in Code-Mixed Tweets0
Native Language Identification Using Large, Longitudinal Data0
Statistical Analysis of Multilingual Text Corpus and Development of Language Models0
GlobalPhone: Pronunciation Dictionaries in 20 Languages0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
bs,hr,srWaC - Web Corpora of Bosnian, Croatian and Serbian0
Accurate Language Identification of Twitter Messages0
Bootstrapping a historical commodities lexicon with SKOS and DBpedia0
Chinese Native Language Identification0
Automatic Detection and Language Identification of Multilingual Documents0
Automatic Identification of Learners' Language Background Based on Their Writing in Czech0
Word Level Language Identification in Online Multilingual Communication0
Linguistic Profiling of Texts Across Textual Genres and Readability Levels. An Exploratory Study on Italian Fictional Prose0
Text segmentation for Language Identification in Greek Forums0
The Mysterious Letter J0
Wordnet-Based Cross-Language Identification of Semantic Relations0
Reconstructing an Indo-European Family Tree from Non-native English Texts0
Categorization of Turkish News Documents with Morphological Analysis0
Crawling microblogging services to gather language-classified URLs. Workflow and case studyCode0
NAIST at the NLI 2013 Shared Task0
NLI Shared Task 2013: MQ Submission0
Cognate and Misspelling Features for Natural Language Identification0
Native Language Identification with PPM0
Discriminating Non-Native English with 350 Words0
Show:102550
← PrevPage 15 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified