SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 151200 of 794 papers

TitleStatusHype
Deep learning-based end-to-end spoken language identification system for domain-mismatched scenario0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
CoSwID, a Code Switching Identification Method Suitable for Under-Resourced Languages0
Dialects Identification of Armenian Language0
Universal Dependencies Treebank for Tatar: Incorporating Intra-Word Code-Switching Information0
HeLI-OTS, Off-the-shelf Language Identifier for Text0
MHE: Code-Mixed Corpora for Similar Language Identification0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
Modernizing Open-Set Speech Language Identification0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge0
Building Machine Translation Systems for the Next Thousand Languages0
TuGeBiC: A Turkish German Bilingual Code-Switching Corpus0
Findings of the Shared Task on Multi-task Learning in Dravidian Languages0
Unsupervised Preference-Aware Language IdentificationCode0
L3Cube-HingCorpus and HingBERT: A Code Mixed Hindi-English Dataset and BERT Language ModelsCode1
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
Transducer-based language embedding for spoken language identification0
Improving Language Identification of Accented Speech0
Partial Coupling of Optimal Transport for Spoken Language Identification0
Code Switched and Code Mixed Speech Recognition for Indic languages0
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language IdentificationCode1
Geographic Adaptation of Pretrained Language ModelsCode0
Automatic Language Identification for Celtic Texts0
Enhance Language Identification using Dual-mode Model with Knowledge DistillationCode0
BERT-LID: Leveraging BERT to Improve Spoken Language IdentificationCode1
Towards a Common Speech Analysis Engine0
Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form SpeechCode0
CALCS 2021 Shared Task: Machine Translation for Code-Switched Data0
HaT5: Hate Language Identification using Text-to-Text Transfer Transformer0
Translated Texts Under the Lens: From Machine Translation Detection to Source Language Identification0
Cognitive Computing to Optimize IT Services0
Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching0
LUC at ComMA-2021 Shared Task: Multilingual Gender Biased and Communal Language Identification without using linguistic features0
Robust Speech Representation Learning via Flow-based Embedding Regularization0
BFCAI at ComMA@ICON 2021: Support Vector Machines for Multilingual Gender Biased and Communal Language Identification0
Beware Haters at ComMA@ICON: Sequence and Ensemble Classifiers for Aggression, Gender Bias and Communal Bias Identification in Indian Languages0
MUCIC at ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Using N-grams and Multilingual Sentence Encoders0
MUM at ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Using Supervised Learning Approaches0
DELab@IIITSM at ICON-2021 Shared Task: Identification of Aggression and Biasness Using Decision Tree0
ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Task at ICON-20210
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at ScaleCode1
Unsupervised Preference-Aware Language Identification0
Developing Successful Shared Tasks on Offensive Language Identification for Dravidian Languages0
Native Language Identification and Reconstruction of Native Language Relationship Using Japanese Learner Corpus0
An Investigation into the Contribution of Locally Aggregated Descriptors to Figurative Language IdentificationCode0
Language Clustering for Multilingual Named Entity Recognition0
Tackling the Score Shift in Cross-Lingual Speaker Verification by Exploiting Language Information0
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language0
Show:102550
← PrevPage 4 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified