SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 501550 of 794 papers

TitleStatusHype
Automatic Language Identification for Romance Languages using Stop Words and Diacritics0
Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System0
Addition of Code Mixed Features to Enhance the Sentiment Prediction of Song Lyrics0
A Comparison of Character Neural Language Model and Bootstrapping for Language Identification in Multilingual Noisy Texts0
Using Language Learner Data for Metaphor DetectionCode0
Cross-corpus Native Language Identification via Statistical Embedding0
Linguistic Features of Sarcasm and Metaphor Production Quality0
T\"ubingen-Oslo at SemEval-2018 Task 2: SVMs perform better than RNNs in Emoji Prediction0
Predicting Foreign Language Usage from English-Only Social Media Posts0
Using Classifier Features to Determine Language Transfer on Morphemes0
What's in a Domain? Learning Domain-Robust Text Representations using Adversarial TrainingCode0
A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech0
Building a TOCFL Learner Corpus for Chinese Grammatical Error Diagnosis0
Towards Language Technology for Mi'kmaq0
From `Solved Problems' to New Challenges: A Report on LDC Activities0
Discriminating between Similar Languages on Imbalanced Conversational Texts0
Discovering Parallel Language Resources for Training MT Engines0
Arabic Dialect Identification in the Context of Bivalency and Code-Switching0
Classification of Closely Related Sub-dialects of Arabic Using Support-Vector Machines0
Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach0
Shami: A Corpus of Levantine Arabic Dialects0
VAST: A Corpus of Video Annotation for Speech Technologies0
Multilingual Multi-class Sentiment Classification Using Convolutional Neural NetworksCode0
Building Parallel Monolingual Gan Chinese Dialects Corpus0
The French-Algerian Code-Switching Triggered audio corpus (FACST)0
Text Normalization Infrastructure that Scales to Hundreds of Language Varieties0
Collecting Code-Switched Data from Social Media0
Coreference Resolution in FreeLing 4.00
A Portuguese Native Language Identification Dataset0
Staircase Network: structural language identification via hierarchical attentive units0
Automatic Language Identification in Texts: A SurveyCode0
Automated essay scoring with string kernels and word embeddings0
Universal Dependency Parsing for Hindi-English Code-switchingCode0
Automatic Language Identification System for Hindi and Magahi0
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification0
Insights into End-to-End Learning Scheme for Language Identification0
Automatic Identification of Closely-related Indian Languages: Resources and Experiments0
Language Identification of Bengali-English Code-Mixed data using Character & Phonetic based LSTM Models0
The WiLI benchmark dataset for written language identificationCode0
Methods for Spoken Language Identification0
Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks0
Using Social Networks to Improve Language Variety Identification with Neural Networks0
Improved Text Language Identification for the South African LanguagesCode0
Building Dialectal Arabic Corpora0
Native Language Identification using Phonetic Algorithms0
Neural Networks and Spelling Features for Native Language Identification0
Native Language Identification Using a Mixture of Character and Word N-grams0
The Power of Character N-grams in Native Language Identification0
CIC-FBK Approach to Native Language Identification0
A deep-learning based native-language classification by using a latent semantic analysis for the NLI Shared Task 20170
Show:102550
← PrevPage 11 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified