SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 601650 of 794 papers

TitleStatusHype
OLR 2021 Challenge: Datasets, Rules and Baselines0
On-Device Language Identification of Text in Images using Diacritic Characters0
On The Performance of Time-Pooling Strategies for End-to-End Spoken Language Identification0
On the use of Performer and Agent Attention for Spoken Language Identification0
Open-Set Language Identification0
Optimizing a Supervised Classifier for a Difficult Language Identification Problem0
OpusFilter: A Configurable Parallel Corpus Filtering Toolbox0
OpusTools and Parallel Corpus Diagnostics0
Oracle and Human Baselines for Native Language Identification0
Oriental Language Recognition (OLR) 2020: Summary and Analysis0
Overview for the First Shared Task on Language Identification in Code-Switched Data0
Overview for the Second Shared Task on Language Identification in Code-Switched Data0
Overview of the DSL Shared Task 20150
Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi0
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification0
Parsing Learner Text: to Shoehorn or not to Shoehorn0
Partial Coupling of Optimal Transport for Spoken Language Identification0
Part of Speech Annotation of a Turkish-German Code-Switching Corpus0
Part-of-Speech Tagging for Code-Switched, Transliterated Texts without Explicit Language Identification0
Part-of-speech Tagging of Code-mixed Social Media Content: Pipeline, Stacking and Joint Modelling0
Part-of-speech Tagging of Code-Mixed Social Media Text0
PGSG at SemEval-2020 Task 12: BERT-LSTM with Tweets' Pretrained Model and Noisy Student Training Method0
Phone-aware Neural Language Identification0
Phonetic Temporal Neural Model for Language Identification0
Pin\_cod\_ at SemEval-2020 Task 12: Injecting Lexicons into Bidirectional Long Short-Term Memory Networks to Detect Turkish Offensive Tweets0
POS Tagging of English-Hindi Code-Mixed Social Media Content0
POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments0
Predicting Code-switching in Multilingual Communication for Immigrant Communities0
Predicting Foreign Language Usage from English-Only Social Media Posts0
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge0
PRHLT-UPV at SemEval-2020 Task 12: BERT for Multilingual Offensive Language Detection0
professionals@DravidianLangTech-EACL2021: Malayalam Offensive Language Identification - A Minimalistic Approach0
Prompt Engineering Using GPT for Word-Level Code-Mixed Language Identification in Low-Resource Dravidian Languages0
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition0
Punctuation as Native Language Interference0
Query log analysis with LangLog0
Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English0
Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting0
Recognizing English Learners' Native Language from Their Writings0
Reconstructing an Indo-European Family Tree from Non-native English Texts0
Recursive Semantic Anchoring in ISO 639:2023: A Structural Extension to ISO/TC 37 Frameworks0
Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification0
Reference Scope Identification in Citing Sentences0
Regression or classification? Automated Essay Scoring for Norwegian0
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition0
RoBERTweet: A BERT Language Model for Romanian Tweets0
Robust, Lexicalized Native Language Identification0
Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset0
Robust Speech Representation Learning via Flow-based Embedding Regularization0
Romanized Berber and Romanized Arabic Automatic Language Identification Using Machine Learning0
Show:102550
← PrevPage 13 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified