SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 110 of 794 papers

TitleStatusHype
mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks0
Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?0
Recursive Semantic Anchoring in ISO 639:2023: A Structural Extension to ISO/TC 37 Frameworks0
TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge0
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-trainingCode11
Token Masking Improves Transformer-Based Text Classification0
Advancing Uto-Aztecan Language Technologies: A Case Study on the Endangered Comanche LanguageCode0
Improving Informally Romanized Language Identification0
(Im)possibility of Automated Hallucination Detection in Large Language Models0
Show:102550
← PrevPage 1 of 80Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified