SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 5175 of 794 papers

TitleStatusHype
FastSpell: the LangId Magic SpellCode1
What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy ConditionsCode0
Geographically-Informed Language IdentificationCode0
More than words: Advancements and challenges in speech recognition for singing0
Validating and Exploring Large Geographic Corpora0
Aligning Speech to Languages to Enhance Code-switching Speech Recognition0
Language and Speech Technology for Central Kurdish VarietiesCode1
KInIT at SemEval-2024 Task 8: Fine-tuned LLMs for Multilingual Machine-Generated Text DetectionCode1
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language IdentificationCode0
Code-Switched Language Identification is Harder Than You ThinkCode0
Detecting Structured Language Alternations in Historical Documents by Combining Language Identification with Fourier Analysis0
Acoustic characterization of speech rhythm: going beyond metrics with recurrent neural networks0
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data FiltersCode1
Language Detection for Transliterated Content0
MathPile: A Billion-Token-Scale Pretraining Corpus for MathCode2
Generative linguistic representation for spoken language identification0
Cross-Linguistic Offensive Language Detection: BERT-Based Analysis of Bengali, Assamese, & Bodo Conversational Hateful Content from Social Media0
Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition0
Attention-Guided Adaptation for Code-Switching Speech Recognition0
Native Language Identification with Large Language Models0
Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification0
A Text-to-Text Model for Multilingual Offensive Language Identification0
Offensive Language Identification in Transliterated and Code-Mixed Bangla0
The Obscure Limitation of Modular Multilingual Language Models0
Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability0
Show:102550
← PrevPage 3 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified