SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 301350 of 794 papers

TitleStatusHype
Cognate and Misspelling Features for Natural Language Identification0
Codewithzichao@DravidianLangTech-EACL2021: Exploring Multilingual Transformers for Offensive Language Identification on Code Mixing Text0
A survey on phrase structure learning methods for text classification0
Code-Switching Ubique Est - Language Identification and Part-of-Speech Tagging for Historical Mixed Text0
Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System0
Codeswitching language identification using Subword Information Enriched Word Vectors0
A Study on Spoken Language Identification using Deep Neural Networks0
American Sign Language Identification Using Hand Trackpoint Analysis0
Garain at SemEval-2020 Task 12: Sequence based Deep Learning for Categorizing Offensive Language in Social Media0
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models0
Fusion of Simple Models for Native Language Identification0
Generation through the lens of learning theory0
Code-Switched Named Entity Recognition with Embedding Attention0
Generative linguistic representation for spoken language identification0
Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability0
Fully Connected Neural Network with Advance Preprocessor to Identify Aggression over Facebook and Twitter0
Code Switched and Code Mixed Speech Recognition for Indic languages0
GlobalPhone: Pronunciation Dictionaries in 20 Languages0
From Visualisation to Hypothesis Construction for Second Language Acquisition0
Code Mixing: A Challenge for Language Identification in the Language of Social Media0
GLUECoS : An Evaluation Benchmark for Code-Switched NLP0
GLUECoS: An Evaluation Benchmark for Code-Switched NLP0
HAD-T\"ubingen at SemEval-2019 Task 6: Deep Learning Analysis of Offensive Language on Twitter: Identification and Categorization0
HaT5: Hate Language Identification using Text-to-Text Transfer Transformer0
ASIREM Participation at the Discriminating Similar Languages Shared Task 20160
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification0
HeLI-based Experiments in Discriminating Between Dutch and Flemish Subtitles0
HeLI-based Experiments in Swiss German Dialect Identification0
Advancing Linguistic Features and Insights by Label-informed Feature Grouping: An Exploration in the Context of Native Language Identification0
HHU at SemEval-2019 Task 6: Context Does Matter - Tackling Offensive Language Identification and Categorization with ELMo0
From Language to Family and Back: Native Language and Language Family Identification from English Text0
Hindi-English Code-Switching Speech Corpus0
Hitachi at SemEval-2020 Task 12: Offensive Language Identification with Noisy Labels using Statistical Sampling and Post-Processing0
HUB@DravidianLangTech-EACL2021: Identify and Classify Offensive Text in Multilingual Code Mixing in Social Media0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Hypers@DravidianLangTech-EACL2021: Offensive language identification in Dravidian code-mixed YouTube Comments and Posts0
CN-HIT-MI.T at SemEval-2019 Task 6: Offensive Language Identification Based on BiLSTM with Double Attention0
From `Solved Problems' to New Challenges: A Report on LDC Activities0
Identification of Indian Languages using Ghost-VLAD pooling0
Identification of Languages in Algerian Arabic Multilingual Documents0
Identification/Segmentation of Indian Regional Languages with Singular Value Decomposition based Feature Embedding0
Identifying Languages at the Word Level in Code-Mixed Indian Social Media Text0
IIITG-ADBU at SemEval-2020 Task 12: Comparison of BERT and BiLSTM in Detecting Offensive Language0
Fluency detection on communication networks0
CLUZH at VarDial GDI 2017: Testing a Variety of Machine Learning Tools for the Classification of Swiss German Dialects0
IIT (BHU) System for Indo-Aryan Language Identification (ILI) at VarDial 20180
IITP-AINLPML at SemEval-2020 Task 12: Offensive Tweet Identification and Target Categorization in a Multitask Environment0
(Im)possibility of Automated Hallucination Detection in Large Language Models0
A Simple and Efficient Probabilistic Language model for Code-Mixed Text0
Show:102550
← PrevPage 7 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified