SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 226250 of 794 papers

TitleStatusHype
Discriminating Similar Languages with Token-Based Backoff0
Challenges in Neural Language Identification: NRC at VarDial 20200
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks0
Distinguishing Literal and Non-Literal Usage of German Particle Verbs0
Distributed Representations of Words and Documents for Discriminating Similar Languages0
Distributional Interaction of Concreteness and Abstractness in Verb--Noun Subcategorisation0
DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data0
DLRG@DravidianLangTech-EACL2021: Transformer based approachfor Offensive Language Identification on Code-Mixed Tamil0
Do Characters Abuse More Than Words?0
A Report on the VarDial Evaluation Campaign 20200
Does the Phonology of L1 Show Up in L2 Texts?0
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain0
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition0
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language0
Duluth at SemEval-2020 Task 12: Offensive Tweet Identification in English with Logistic Regression0
Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification0
Efficient Discrimination Between Closely Related Languages0
Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset0
Emad at SemEval-2019 Task 6: Offensive Language Identification using Traditional Machine Learning and Deep Learning approaches0
BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive Language Identification in Social Media0
Arabic Dialect Identification in the Context of Bivalency and Code-Switching0
BRUMS at SemEval-2020 Task 12: Transformer Based Multilingual Offensive Language Identification in Social Media0
Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
Categorization of Turkish News Documents with Morphological Analysis0
Show:102550
← PrevPage 10 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified