SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 201250 of 794 papers

TitleStatusHype
DeepAnalyzer at SemEval-2019 Task 6: A deep learning-based ensemble method for identifying offensive tweets0
Deep learning-based end-to-end spoken language identification system for domain-mismatched scenario0
Deep Models for Arabic Dialect Identification on Benchmarked Data0
DELab@IIITSM at ICON-2021 Shared Task: Identification of Aggression and Biasness Using Decision Tree0
Demographic Dialectal Variation in Social Media: A Case Study of African-American English0
Detecting Code-Switching in a Multilingual Alpine Heritage Corpus0
Ensemble Methods for Native Language Identification0
Detection of Similar Languages and Dialects Using Deep Supervised Autoencoder0
Detect Language of Transliterated Texts0
Developing Language-tagged Corpora for Code-switching Tweets0
Challenges of Computational Processing of Code-Switching0
Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment0
Dialect Diversity in Text Summarization on Twitter0
Dialects Identification of Armenian Language0
Discovering Parallel Language Resources for Training MT Engines0
Discriminating between Indo-Aryan Languages Using SVM Ensembles0
Discriminating between Mandarin Chinese and Swiss-German varieties using adaptive language models0
Discriminating between Similar Languages Using PPM0
Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search0
Discriminating between Similar Languages and Arabic Dialect Identification: A Report on the Third DSL Shared Task0
Discriminating between Similar Languages on Imbalanced Conversational Texts0
Discriminating between Similar Languages with Word-level Convolutional Neural Networks0
BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification0
Discriminating Non-Native English with 350 Words0
Discriminating Similar Languages with Linear SVMs and Neural Networks0
Discriminating Similar Languages with Token-Based Backoff0
Challenges in Neural Language Identification: NRC at VarDial 20200
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks0
Distinguishing Literal and Non-Literal Usage of German Particle Verbs0
Distributed Representations of Words and Documents for Discriminating Similar Languages0
Distributional Interaction of Concreteness and Abstractness in Verb--Noun Subcategorisation0
DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data0
DLRG@DravidianLangTech-EACL2021: Transformer based approachfor Offensive Language Identification on Code-Mixed Tamil0
Do Characters Abuse More Than Words?0
A Report on the VarDial Evaluation Campaign 20200
Does the Phonology of L1 Show Up in L2 Texts?0
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain0
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition0
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language0
Duluth at SemEval-2020 Task 12: Offensive Tweet Identification in English with Logistic Regression0
Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification0
Efficient Discrimination Between Closely Related Languages0
Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset0
Emad at SemEval-2019 Task 6: Offensive Language Identification using Traditional Machine Learning and Deep Learning approaches0
BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive Language Identification in Social Media0
Arabic Dialect Identification in the Context of Bivalency and Code-Switching0
BRUMS at SemEval-2020 Task 12: Transformer Based Multilingual Offensive Language Identification in Social Media0
Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
Categorization of Turkish News Documents with Morphological Analysis0
Show:102550
← PrevPage 5 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified