SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 451500 of 794 papers

TitleStatusHype
Norwegian Native Language Identification0
NTU\_NLP at SemEval-2020 Task 12: Identifying Offensive Tweets Using Hierarchical Multi-Task Learning Approach0
NUIG at SemEval-2020 Task 12: Pseudo Labelling for Offensive Content Classification0
NULI at SemEval-2019 Task 6: Transfer Learning for Offensive Language Detection using Bidirectional Transformers0
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts0
OcWikiDisc: a Corpus of Wikipedia Talk Pages in Occitan0
Absit invidia verbo: Comparing Deep Learning methods for offensive language0
Offensive language identification in Dravidian code mixed social media text0
Offensive Language Identification in Transliterated and Code-Mixed Bangla0
OFFLangOne@DravidianLangTech-EACL2021: Transformers with the Class Balanced Loss for Offensive Language Identification in Dravidian Code-Mixed text.0
OffTamil@DravideanLangTech-EASL2021: Offensive Language Identification in Tamil Text0
OLR 2021 Challenge: Datasets, Rules and Baselines0
On-Device Language Identification of Text in Images using Diacritic Characters0
On The Performance of Time-Pooling Strategies for End-to-End Spoken Language Identification0
On the use of Performer and Agent Attention for Spoken Language Identification0
Open-Set Language Identification0
Optimizing a Supervised Classifier for a Difficult Language Identification Problem0
OpusFilter: A Configurable Parallel Corpus Filtering Toolbox0
OpusTools and Parallel Corpus Diagnostics0
Oracle and Human Baselines for Native Language Identification0
Oriental Language Recognition (OLR) 2020: Summary and Analysis0
Overview for the First Shared Task on Language Identification in Code-Switched Data0
Overview for the Second Shared Task on Language Identification in Code-Switched Data0
Overview of the DSL Shared Task 20150
Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi0
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification0
Parsing Learner Text: to Shoehorn or not to Shoehorn0
Partial Coupling of Optimal Transport for Spoken Language Identification0
Part of Speech Annotation of a Turkish-German Code-Switching Corpus0
Part-of-Speech Tagging for Code-Switched, Transliterated Texts without Explicit Language Identification0
Part-of-speech Tagging of Code-mixed Social Media Content: Pipeline, Stacking and Joint Modelling0
Part-of-speech Tagging of Code-Mixed Social Media Text0
PGSG at SemEval-2020 Task 12: BERT-LSTM with Tweets' Pretrained Model and Noisy Student Training Method0
Phone-aware Neural Language Identification0
Phonetic Temporal Neural Model for Language Identification0
Pin\_cod\_ at SemEval-2020 Task 12: Injecting Lexicons into Bidirectional Long Short-Term Memory Networks to Detect Turkish Offensive Tweets0
POS Tagging of English-Hindi Code-Mixed Social Media Content0
POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments0
Predicting Code-switching in Multilingual Communication for Immigrant Communities0
Predicting Foreign Language Usage from English-Only Social Media Posts0
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge0
PRHLT-UPV at SemEval-2020 Task 12: BERT for Multilingual Offensive Language Detection0
professionals@DravidianLangTech-EACL2021: Malayalam Offensive Language Identification - A Minimalistic Approach0
Prompt Engineering Using GPT for Word-Level Code-Mixed Language Identification in Low-Resource Dravidian Languages0
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition0
Punctuation as Native Language Interference0
Query log analysis with LangLog0
Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English0
Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting0
Recognizing English Learners' Native Language from Their Writings0
Show:102550
← PrevPage 10 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified