SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 201250 of 794 papers

TitleStatusHype
DeepAnalyzer at SemEval-2019 Task 6: A deep learning-based ensemble method for identifying offensive tweets0
Deep learning-based end-to-end spoken language identification system for domain-mismatched scenario0
CUSATNLP@HASOC-Dravidian-CodeMix-FIRE2020:Identifying Offensive Language from ManglishTweets0
DELab@IIITSM at ICON-2021 Shared Task: Identification of Aggression and Biasness Using Decision Tree0
CUSATNLP@DravidianLangTech-EACL2021:Language Agnostic Classification of Offensive Content in Tweets0
Detecting Code-Switching in a Multilingual Alpine Heritage Corpus0
Automatic Identification of Learners' Language Background Based on Their Writing in Czech0
Detection of Similar Languages and Dialects Using Deep Supervised Autoencoder0
Detect Language of Transliterated Texts0
Developing Language-tagged Corpora for Code-switching Tweets0
Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks0
Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment0
Dialect Diversity in Text Summarization on Twitter0
Dialects Identification of Armenian Language0
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages0
Discriminating between Indo-Aryan Languages Using SVM Ensembles0
Discriminating between Mandarin Chinese and Swiss-German varieties using adaptive language models0
Discriminating between Similar Languages Using PPM0
Automatic Identification of Closely-related Indian Languages: Resources and Experiments0
Discriminating between Similar Languages and Arabic Dialect Identification: A Report on the Third DSL Shared Task0
Discriminating between Similar Languages on Imbalanced Conversational Texts0
Discriminating between Similar Languages with Word-level Convolutional Neural Networks0
cs@DravidianLangTech-EACL2021: Offensive Language Identification Based On Multilingual BERT Model0
Discriminating Non-Native English with 350 Words0
Discriminating Similar Languages with Linear SVMs and Neural Networks0
Discriminating Similar Languages with Token-Based Backoff0
Cross-Linguistic Offensive Language Detection: BERT-Based Analysis of Bengali, Assamese, & Bodo Conversational Hateful Content from Social Media0
Automatic discovery of Latin syntactic changes0
Distinguishing Literal and Non-Literal Usage of German Particle Verbs0
Distributed Representations of Words and Documents for Discriminating Similar Languages0
Distributional Interaction of Concreteness and Abstractness in Verb--Noun Subcategorisation0
DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data0
DLRG@DravidianLangTech-EACL2021: Transformer based approachfor Offensive Language Identification on Code-Mixed Tamil0
Do Characters Abuse More Than Words?0
Anglicized Words and Misspelled Cognates in Native Language Identification0
A Federated Learning Approach to Privacy Preserving Offensive Language Identification0
A Dataset and Classifier for Recognizing Social Media English0
Accurate Pinyin-English Codeswitched Language Identification0
Cross-lingual Inductive Transfer to Detect Offensive Language0
Duluth at SemEval-2020 Task 12: Offensive Tweet Identification in English with Logistic Regression0
Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification0
Efficient Discrimination Between Closely Related Languages0
Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset0
Emad at SemEval-2019 Task 6: Offensive Language Identification using Traditional Machine Learning and Deep Learning approaches0
Cross-domain Feature Selection for Language Identification0
Automatic Detection of Sentence Fragments0
An Exploratory Analysis of the Relation Between Offensive Language and Mental Health0
Cross-corpus Native Language Identification via Statistical Embedding0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
Cross-Corpora Spoken Language Identification with Domain Diversification and Generalization0
Show:102550
← PrevPage 5 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified