SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 401450 of 794 papers

TitleStatusHype
From Visualisation to Hypothesis Construction for Second Language Acquisition0
Fully Connected Neural Network with Advance Preprocessor to Identify Aggression over Facebook and Twitter0
Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability0
Fusion of Simple Models for Native Language Identification0
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models0
Garain at SemEval-2020 Task 12: Sequence based Deep Learning for Categorizing Offensive Language in Social Media0
Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System0
Generation through the lens of learning theory0
Generative linguistic representation for spoken language identification0
GlobalPhone: Pronunciation Dictionaries in 20 Languages0
GLUECoS : An Evaluation Benchmark for Code-Switched NLP0
GLUECoS: An Evaluation Benchmark for Code-Switched NLP0
HAD-T\"ubingen at SemEval-2019 Task 6: Deep Learning Analysis of Offensive Language on Twitter: Identification and Categorization0
HaT5: Hate Language Identification using Text-to-Text Transfer Transformer0
HeLI-based Experiments in Discriminating Between Dutch and Flemish Subtitles0
HeLI-based Experiments in Swiss German Dialect Identification0
HeLI-OTS, Off-the-shelf Language Identifier for Text0
HHU at SemEval-2019 Task 6: Context Does Matter - Tackling Offensive Language Identification and Categorization with ELMo0
Hindi-English Code-Switching Speech Corpus0
Hitachi at SemEval-2020 Task 12: Offensive Language Identification with Noisy Labels using Statistical Sampling and Post-Processing0
HUB@DravidianLangTech-EACL2021: Identify and Classify Offensive Text in Multilingual Code Mixing in Social Media0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Hypers@DravidianLangTech-EACL2021: Offensive language identification in Dravidian code-mixed YouTube Comments and Posts0
iCompass at SemEval-2020 Task 12: From a Syntax-ignorant N-gram Embeddings Model to a Deep Bidirectional Language Model0
Identification of Indian Languages using Ghost-VLAD pooling0
Identification of Languages in Algerian Arabic Multilingual Documents0
Identification/Segmentation of Indian Regional Languages with Singular Value Decomposition based Feature Embedding0
Identifying Languages at the Word Level in Code-Mixed Indian Social Media Text0
IIITG-ADBU at SemEval-2020 Task 12: Comparison of BERT and BiLSTM in Detecting Offensive Language0
IIT (BHU) System for Indo-Aryan Language Identification (ILI) at VarDial 20180
IITP-AINLPML at SemEval-2020 Task 12: Offensive Tweet Identification and Target Categorization in a Multitask Environment0
(Im)possibility of Automated Hallucination Detection in Large Language Models0
Improved Language Identification Through Cross-Lingual Self-Supervised Learning0
Improving Cuneiform Language Identification with BERT0
Improving Informally Romanized Language Identification0
Improving Language Identification for Multilingual Speakers0
Improving Language Identification of Accented Speech0
Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking0
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
Improving Native Language Identification by Using Spelling Errors0
Improving Native Language Identification with TF-IDF Weighting0
Improving the accuracy of pronunciation lexicon using Naive Bayes classifier with character n-gram as feature: for language classified pronunciation lexicon generation0
Improving the Character Ngram Model for the DSL Task with BM25 Weighting and Less Frequently Used Feature Sets0
Improving the exploitation of linguistic annotations in ELAN0
Improving the results of string kernels in sentiment analysis and Arabic dialect identification by adapting them to your test set0
Incorporating Dialectal Variability for Socially Equitable Language Identification0
Incremental N-gram Approach for Language Identification in Code-Switched Text0
Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and Bert LID0
Influence of Mother Tongue on English Accent0
Show:102550
← PrevPage 9 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified