SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 51100 of 794 papers

TitleStatusHype
Language Identification Using Deep Convolutional Recurrent Neural NetworksCode0
Language Identification with a Reciprocal Rank ClassifierCode0
Aggressive Language Identification Using Word Embeddings and Sentiment FeaturesCode0
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task LearningCode0
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarizationCode0
MERLIon CCS Challenge Evaluation PlanCode0
Multilingual Multi-class Sentiment Classification Using Convolutional Neural NetworksCode0
Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and LakiCode0
Native Language Identification with Big Bird EmbeddingsCode0
Language Variety Identification with True LabelsCode0
Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labelingCode0
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language IdentificationCode0
Measuring language distance among historical varieties using perplexity. Application to European Portuguese.Code0
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuningCode0
Joint UD Parsing of Norwegian Bokm and NynorskCode0
Is It Navajo? Accurate Language Detection in Endangered Athabaskan LanguagesCode0
JU\_ETCE\_17\_21 at SemEval-2019 Task 6: Efficient Machine Learning and Neural Network Approaches for Identifying and Categorizing Offensive Language in TweetsCode0
indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian LanguagesCode0
IIITK@DravidianLangTech-EACL2021: Offensive Language Identification and Meme Classification in Tamil, Malayalam and KannadaCode0
Improved Text Language Identification for the South African LanguagesCode0
indicnlp@ kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian LanguagesCode0
HeLI, a Word-Based Backoff Method for Language IdentificationCode0
Geographically-Informed Language IdentificationCode0
Hierarchical Character-Word Models for Language IdentificationCode0
Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form SpeechCode0
From English to Code-Switching: Transfer Learning with Strong Morphological CluesCode0
Advancing Uto-Aztecan Language Technologies: A Case Study on the Endangered Comanche LanguageCode0
From N-grams to Pre-trained Multilingual Models For Language IdentificationCode0
Geographic Adaptation of Pretrained Language ModelsCode0
KréyoLID From Language Identification Towards Language MiningCode0
Enhance Language Identification using Dual-mode Model with Knowledge DistillationCode0
GeezSwitch: Language Identification in Typologically Related Low-resourced East African LanguagesCode0
FBK-DH at SemEval-2020 Task 12: Using Multi-channel BERT for Multilingual Offensive Language DetectionCode0
Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language IdentificationCode0
A study of N-gram and Embedding Representations for Native Language IdentificationCode0
Hate-Alert@DravidianLangTech-EACL2021: Ensembling strategies for Transformer-based Offensive language DetectionCode0
End-to-end Language Identification using NetFV and NetVLADCode0
IIITT@DravidianLangTech-EACL2021: Transfer Learning for Offensive Language Detection in Dravidian LanguagesCode0
Improving Multilingual ASR in the Wild Using Simple N-best Re-rankingCode0
English Please: Evaluating Machine Translation with Large Language Models for Multilingual Bug ReportsCode0
Finding Structure in Text, Genome and Other Symbolic SequencesCode0
Investigating model performance in language identification: beyond simple error statisticsCode0
Italian Language and Dialect Identification and Regional French Variety Detection using Adaptive Naive BayesCode0
Automatic Dialect Detection in Arabic Broadcast SpeechCode0
DOSA: Dravidian Code-Mixed Offensive Span Identification DatasetCode0
DocLangID: Improving Few-Shot Training to Identify the Language of Historical DocumentsCode0
A Semisupervised Approach for Language Identification based on Ladder NetworksCode0
Language Identification for Austronesian LanguagesCode0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesCode0
Embeddia at SemEval-2019 Task 6: Detecting Hate with Neural Network and Transfer Learning ApproachesCode0
Show:102550
← PrevPage 2 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified