SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 151175 of 794 papers

TitleStatusHype
Deep learning-based end-to-end spoken language identification system for domain-mismatched scenario0
GeezSwitch: Language Identification in Typologically Related Low-resourced East African LanguagesCode0
CoSwID, a Code Switching Identification Method Suitable for Under-Resourced Languages0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Universal Dependencies Treebank for Tatar: Incorporating Intra-Word Code-Switching Information0
HeLI-OTS, Off-the-shelf Language Identifier for Text0
MHE: Code-Mixed Corpora for Similar Language Identification0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
Modernizing Open-Set Speech Language Identification0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge0
Building Machine Translation Systems for the Next Thousand Languages0
TuGeBiC: A Turkish German Bilingual Code-Switching Corpus0
Findings of the Shared Task on Multi-task Learning in Dravidian Languages0
Unsupervised Preference-Aware Language IdentificationCode0
L3Cube-HingCorpus and HingBERT: A Code Mixed Hindi-English Dataset and BERT Language ModelsCode1
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
Transducer-based language embedding for spoken language identification0
Partial Coupling of Optimal Transport for Spoken Language Identification0
Improving Language Identification of Accented Speech0
Code Switched and Code Mixed Speech Recognition for Indic languages0
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language IdentificationCode1
Geographic Adaptation of Pretrained Language ModelsCode0
Automatic Language Identification for Celtic Texts0
Show:102550
← PrevPage 7 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified