SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 201225 of 794 papers

TitleStatusHype
Hyperseed: Unsupervised Learning with Vector Symbolic ArchitecturesCode1
Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models0
Pretrained Transformers for Offensive Language Identification in TanglishCode0
Is Attention always needed? A Case Study on Language Identification from Speech0
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition0
Language Identification with a Reciprocal Rank ClassifierCode0
UPV at CheckThat! 2021: Mitigating Cultural Differences for Identifying Multilingual Check-worthy ClaimsCode0
Unsupervised Personality-Aware Language Identification0
The futility of STILTs for the classification of lexical borrowings in Spanish0
On the Language-specificity of Multilingual BERT and the Impact of Fine-tuningCode0
FBERT: A Neural Transformer for Identifying Offensive Content0
Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of MarathiCode0
A Pre-trained Transformer and CNN Model with Joint Language ID and Part-of-Speech Tagging for Code-Mixed Social-Media Text0
Fiction in Russian Translation: A Translationese Study0
Corpus Creation and Language Identification in Low-Resource Code-Mixed Telugu-English Text0
Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labelingCode0
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and PostsCode0
A Dual-Decoder Conformer for Multilingual Speech Recognition0
Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification0
OLR 2021 Challenge: Datasets, Rules and Baselines0
Improved Language Identification Through Cross-Lingual Self-Supervised Learning0
Oriental Language Recognition (OLR) 2020: Summary and Analysis0
Language Identification of Hindi-English tweets using code-mixed BERT0
Language Lexicons for Hindi-English Multilingual Text Processing0
A Simple and Efficient Probabilistic Language model for Code-Mixed Text0
Show:102550
← PrevPage 9 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified