SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 351400 of 794 papers

TitleStatusHype
Language Identification in Code-Switched Text Using Conditional Random Fields and Babelnet0
Language Identification in Code-Switching Scenario0
Language Identification of Bengali-English Code-Mixed data using Character & Phonetic based LSTM Models0
Language Identification of Devanagari Poems0
Language Identification of Hindi-English tweets using code-mixed BERT0
Language Identification on Massive Datasets of Short Message using an Attention Mechanism CNN0
Language Identification using Classifier Ensembles0
Language Identification with Deep Bottleneck Features0
Language ID Prediction from Speech Using Self-Attentive Pooling and 1D-Convolutions0
Language ID Prediction from Speech Using Self-Attentive Pooling0
Language Lexicons for Hindi-English Multilingual Text Processing0
Language Model Adaptation for Language and Dialect Identification of Text0
Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data0
Language Modeling with Functional Head Constraint for Code Switching Speech Recognition0
Language Transfer Hypotheses with Linear SVM Weights0
Language variety identification in Spanish tweets0
Large Scale Lexical Analysis0
Large-Scale Native Language Identification with Cross-Corpus Evaluation0
Learning Multilingual Meta-Embeddings for Code-Switching Named Entity Recognition0
Learning with learner corpora: Using the TLE for native language identification0
Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding0
Leveraging Data-Driven Methods in Word-Level Language Identification for a Multilingual Alpine Heritage Corpus0
Leveraging Language Identification to Enhance Code-Mixed Text Classification0
Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition0
Leveraging Latent Representations of Speech for Indian Language Identification0
Leveraging Open-Source Large Language Models for Native Language Identification0
Lexical Normalization for Code-switched Data and its Effect on POS-tagging0
LIDE: Language Identification from Text Documents0
LIIR at SemEval-2020 Task 12: A Cross-Lingual Augmentation Approach for Multilingual Offensive Language Identification0
LILI: A Simple Language Independent Approach for Language Identification0
LIMSI's participation to the 2013 shared task on Native Language Identification0
LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation0
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.0
Linguistic Features of Sarcasm and Metaphor Production Quality0
Linguistic Profiling based on General--purpose Features and Native Language Identification0
Linguistic Profiling of Texts Across Textual Genres and Readability Levels. An Exploratory Study on Italian Fictional Prose0
LISAC FSDM-USMBA Team at SemEval-2020 Task 12: Overcoming AraBERT's pretrain-finetune discrepancy for Arabic offensive language identification0
Listen, Read, and Identify: Multimodal Singing Language Identification of Music0
Literary and Colloquial Dialect Identification for Tamil using Acoustic Features0
Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions0
LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?0
LUC at ComMA-2021 Shared Task: Multilingual Gender Biased and Communal Language Identification without using linguistic features0
Lump at SemEval-2017 Task 1: Towards an Interlingua Semantic Similarity0
Machine Learning Based Source Code Classification Using Syntax Oriented Features0
Machine Learning for Rhetorical Figure Detection: More Chiasmus with Less Annotation0
Malayalam Sign Language Identification using Finetuned YOLOv8 and Computer Vision Techniques0
Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models0
Mapping Languages: The Corpus of Global Language Use0
MASR: Multi-label Aware Speech Representation0
Maximizing Classification Accuracy in Native Language Identification0
Show:102550
← PrevPage 8 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified