SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 101125 of 794 papers

TitleStatusHype
Offensive Language Identification in Transliterated and Code-Mixed Bangla0
The Obscure Limitation of Modular Multilingual Language Models0
Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability0
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language IdentificationCode0
Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition0
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond0
Wavelet Scattering Transform for Improving Generalization in Low-Resourced Spoken Language Identification0
Multimodal Modeling For Spoken Language Identification0
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages0
Native Language Identification with Big Bird EmbeddingsCode0
Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset0
Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts0
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss0
Turkish Native Language Identification0
MASR: Multi-label Aware Speech Representation0
Multilingual Speech-to-Speech Translation into Multiple Target Languages0
Towards spoken dialect identification of Irish0
Confidence-based Ensembles of End-to-End Speech Recognition Models0
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks0
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer0
RoBERTweet: A BERT Language Model for Romanian Tweets0
Leveraging Language Identification to Enhance Code-Mixed Text Classification0
Label Aware Speech Representation Learning For Language Identification0
Spoken Language Identification System for English-Mandarin Code-Switching Child-Directed SpeechCode0
MERLIon CCS Challenge Evaluation PlanCode0
Show:102550
← PrevPage 5 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified