SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 701750 of 794 papers

TitleStatusHype
Incremental N-gram Approach for Language Identification in Code-Switched Text0
Language Identification in Code-Switching Scenario0
Language Family Relationship Preserved in Non-native English0
A Report on the DSL Shared Task 20140
Exploring Syntactic Features for Native Language Identification: A Variationist Perspective on Feature Encoding and Ensemble Optimization0
The NRC System for Discriminating Similar Languages0
Exploring Methods and Resources for Discriminating Similar Languages0
Experiments in Sentence Language Identification with Groups of Similar Languages0
Subsegmental language detection in Celtic language text0
Using Maximum Entropy Models to Discriminate between Similar Languages and Varieties0
A survey on phrase structure learning methods for text classification0
DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data0
Does the Phonology of L1 Show Up in L2 Texts?0
Short-Term Projects, Long-Term Benefits: Four Student NLP Projects for Low-Resource Languages0
Unsupervised Feature Learning for Visual Sign Language Identification0
AUTOMATIC LANGUAGE IDENTIFICATION USING DEEP NEURAL NETWORKS0
SwissAdmin: A multilingual tagged parallel corpus of press releases0
Improving the exploitation of linguistic annotations in ELAN0
KoKo: an L1 Learner Corpus for German0
Statistical Analysis of Multilingual Text Corpus and Development of Language Models0
The MERLIN corpus: Learner language and the CEFR0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
TLAXCALA: a multilingual corpus of independent news0
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish0
GlobalPhone: Pronunciation Dictionaries in 20 Languages0
TweetCaT: a tool for building Twitter corpora of smaller languagesCode0
Native Language Identification Using Large, Longitudinal Data0
Finding Romanized Arabic Dialect in Code-Mixed Tweets0
VarClass: An Open-source Language Identification Tool for Language Varieties0
Vocabulary-Based Language Similarity using Web Corpora0
Facing the Identification Problem in Language-Related Scientific Data Analysis.0
Bootstrapping a historical commodities lexicon with SKOS and DBpedia0
Chinese Native Language Identification0
bs,hr,srWaC - Web Corpora of Bosnian, Croatian and Serbian0
Accurate Language Identification of Twitter Messages0
Automatic Detection and Language Identification of Multilingual Documents0
Automatic Identification of Learners' Language Background Based on Their Writing in Czech0
Word Level Language Identification in Online Multilingual Communication0
The Mysterious Letter J0
Linguistic Profiling of Texts Across Textual Genres and Readability Levels. An Exploratory Study on Italian Fictional Prose0
Text segmentation for Language Identification in Greek Forums0
Categorization of Turkish News Documents with Morphological Analysis0
Crawling microblogging services to gather language-classified URLs. Workflow and case studyCode0
Wordnet-Based Cross-Language Identification of Semantic Relations0
Reconstructing an Indo-European Family Tree from Non-native English Texts0
Extracting the Native Language Signal for Second Language Acquisition0
Combining Shallow and Linguistically Motivated Features in Native Language Identification0
Discriminating Non-Native English with 350 Words0
From Language to Family and Back: Native Language and Language Family Identification from English Text0
Native Language Identification with PPM0
Show:102550
← PrevPage 15 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified