SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 151200 of 794 papers

TitleStatusHype
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers0
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification0
Italian Language and Dialect Identification and Regional French Variety Detection using Adaptive Naive BayesCode0
Neural Networks for Cross-domain Language Identification. Phlyers @Vardial 20220
OcWikiDisc: a Corpus of Wikipedia Talk Pages in Occitan0
The Curious Case of Logistic Regression for Italian Languages and Dialects IdentificationCode0
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification0
Evaluation of Off-the-Shelf Language Identification Tools on Bulgarian Social Media Posts0
Unravelling Interlanguage Facts via Explainable Machine Learning0
Extending RNN-T-based speech recognition systems with emotion and language classification0
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesCode0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
TechSSN at SemEval-2022 Task 6: Intended Sarcasm Detection using Transformer Models0
Language Identification for Austronesian LanguagesCode0
HeLI-OTS, Off-the-shelf Language Identifier for Text0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Deep learning-based end-to-end spoken language identification system for domain-mismatched scenario0
CoSwID, a Code Switching Identification Method Suitable for Under-Resourced Languages0
GeezSwitch: Language Identification in Typologically Related Low-resourced East African LanguagesCode0
MHE: Code-Mixed Corpora for Similar Language Identification0
Universal Dependencies Treebank for Tatar: Incorporating Intra-Word Code-Switching Information0
Dialects Identification of Armenian Language0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
Modernizing Open-Set Speech Language Identification0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge0
Building Machine Translation Systems for the Next Thousand Languages0
TuGeBiC: A Turkish German Bilingual Code-Switching Corpus0
Unsupervised Preference-Aware Language IdentificationCode0
Findings of the Shared Task on Multi-task Learning in Dravidian Languages0
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
Transducer-based language embedding for spoken language identification0
Partial Coupling of Optimal Transport for Spoken Language Identification0
Improving Language Identification of Accented Speech0
Code Switched and Code Mixed Speech Recognition for Indic languages0
Geographic Adaptation of Pretrained Language ModelsCode0
Automatic Language Identification for Celtic Texts0
Enhance Language Identification using Dual-mode Model with Knowledge DistillationCode0
Towards a Common Speech Analysis Engine0
Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form SpeechCode0
CALCS 2021 Shared Task: Machine Translation for Code-Switched Data0
HaT5: Hate Language Identification using Text-to-Text Transfer Transformer0
Translated Texts Under the Lens: From Machine Translation Detection to Source Language Identification0
Cognitive Computing to Optimize IT Services0
LUC at ComMA-2021 Shared Task: Multilingual Gender Biased and Communal Language Identification without using linguistic features0
Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching0
Robust Speech Representation Learning via Flow-based Embedding Regularization0
MUM at ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Using Supervised Learning Approaches0
BFCAI at ComMA@ICON 2021: Support Vector Machines for Multilingual Gender Biased and Communal Language Identification0
Show:102550
← PrevPage 4 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified