SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 151175 of 794 papers

TitleStatusHype
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers0
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification0
Italian Language and Dialect Identification and Regional French Variety Detection using Adaptive Naive BayesCode0
Neural Networks for Cross-domain Language Identification. Phlyers @Vardial 20220
OcWikiDisc: a Corpus of Wikipedia Talk Pages in Occitan0
The Curious Case of Logistic Regression for Italian Languages and Dialects IdentificationCode0
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification0
Evaluation of Off-the-Shelf Language Identification Tools on Bulgarian Social Media Posts0
Unravelling Interlanguage Facts via Explainable Machine Learning0
Extending RNN-T-based speech recognition systems with emotion and language classification0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesCode0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
TechSSN at SemEval-2022 Task 6: Intended Sarcasm Detection using Transformer Models0
Language Identification for Austronesian LanguagesCode0
HeLI-OTS, Off-the-shelf Language Identifier for Text0
Dialects Identification of Armenian Language0
MHE: Code-Mixed Corpora for Similar Language Identification0
Deep learning-based end-to-end spoken language identification system for domain-mismatched scenario0
GeezSwitch: Language Identification in Typologically Related Low-resourced East African LanguagesCode0
Universal Dependencies Treebank for Tatar: Incorporating Intra-Word Code-Switching Information0
CoSwID, a Code Switching Identification Method Suitable for Under-Resourced Languages0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
Modernizing Open-Set Speech Language Identification0
Show:102550
← PrevPage 7 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified