SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 76100 of 794 papers

TitleStatusHype
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language IdentificationCode0
GlotLID: Language Identification for Low-Resource LanguagesCode1
Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition0
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond0
Wavelet Scattering Transform for Improving Generalization in Low-Resourced Spoken Language Identification0
Multimodal Modeling For Spoken Language Identification0
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages0
Visual Speech Recognition for Languages with Limited Labeled Data using Automatic Labels from WhisperCode1
Native Language Identification with Big Bird EmbeddingsCode0
Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset0
Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts0
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss0
Turkish Native Language Identification0
MASR: Multi-label Aware Speech Representation0
Multilingual Speech-to-Speech Translation into Multiple Target Languages0
Towards spoken dialect identification of Irish0
Confidence-based Ensembles of End-to-End Speech Recognition Models0
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation BenchmarksCode0
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizerCode0
RoBERTweet: A BERT Language Model for Romanian Tweets0
Leveraging Language Identification to Enhance Code-Mixed Text Classification0
Label Aware Speech Representation Learning For Language Identification0
Spoken Language Identification System for English-Mandarin Code-Switching Child-Directed SpeechCode0
Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning0
MERLIon CCS Challenge Evaluation PlanCode0
Show:102550
← PrevPage 4 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified