SOTAVerified

Language Identification

Language identification is the task of determining the language of a text.

Papers

Showing 351400 of 794 papers

TitleStatusHype
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++Code0
Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language IdentificationCode0
Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts0
A Study on Spoken Language Identification using Deep Neural Networks0
WOLI at SemEval-2020 Task 12: Arabic Offensive Language Identification on Different Twitter Datasets0
Garain at SemEval-2020 Task 12: Sequence based Deep Learning for Categorizing Offensive Language in Social Media0
Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpus0
SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets0
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual TransferCode1
LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?0
Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic LanguagesCode0
SalamNET at SemEval-2020 Task12: Deep Learning Approach for Arabic Offensive Language Detection0
KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social MediaCode1
Duluth at SemEval-2020 Task 12: Offensive Tweet Identification in English with Logistic Regression0
problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approachesCode0
XD at SemEval-2020 Task 12: Ensemble Approach to Offensive Language Identification in Social Media Using Transformer Encoders0
Dialect Diversity in Text Summarization on Twitter0
Fine-grained Language Identification with Multilingual CapsNet Model0
The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results0
Feature Selection on Noisy Twitter Short Text Messages for Language Identification0
Streaming End-to-End Bilingual ASR Systems with Joint Language Identification0
Cross-lingual Inductive Transfer to Detect Offensive Language0
An Assessment of Language Identification Methods on Tweets and Wikipedia Articles0
A Report on the 2020 VUA and TOEFL Metaphor Detection Shared Task0
OpusFilter: A Configurable Parallel Corpus Filtering Toolbox0
GLUECoS: An Evaluation Benchmark for Code-Switched NLP0
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions0
Adaptation de domaine non supervis\'ee pour la reconnaissance de la langue par r\'egularisation d'un r\'eseau de neurones (Unsupervised domain adaptation for language identification by regularization of a neural network)0
Lexical Normalization for Code-switched Data and its Effect on POS-tagging0
Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses0
Identification/Segmentation of Indian Regional Languages with Singular Value Decomposition based Feature Embedding0
LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation0
LIIR at SemEval-2020 Task 12: A Cross-Lingual Augmentation Approach for Multilingual Offensive Language Identification0
OpusTools and Parallel Corpus Diagnostics0
Search Query Language Identification Using Weak Labeling0
On The Performance of Time-Pooling Strategies for End-to-End Spoken Language Identification0
Building Web Corpora for Minority Languages0
Two LRL \& Distractor Corpora from Web Information Retrieval and a Small Case Study in Language Identification without Training Corpora0
Hitachi at SemEval-2020 Task 12: Offensive Language Identification with Noisy Labels using Statistical Sampling and Post-Processing0
SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification0
Detect Language of Transliterated Texts0
GLUECoS : An Evaluation Benchmark for Code-Switched NLP0
On the Language Neutrality of Pre-trained Multilingual RepresentationsCode0
Mapping Languages: The Corpus of Global Language Use0
Towards Relevance and Sequence Modeling in Language Recognition0
Offensive Language Identification in GreekCode0
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition0
Identification of Indian Languages using Ghost-VLAD pooling0
Improving Language Identification for Multilingual Speakers0
Common Voice: A Massively-Multilingual Speech CorpusCode1
Show:102550
← PrevPage 8 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1wav2vec 2.0 LV-60KError rate7.2Unverified
2XLS-RError rate5.7Unverified
#ModelMetricClaimedVerifiedStatus
1GlotLIDMacro F10.98Unverified
#ModelMetricClaimedVerifiedStatus
1FastTextAccuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy91.37Unverified
#ModelMetricClaimedVerifiedStatus
1Apple bi-LSTMAccuracy86.93Unverified
#ModelMetricClaimedVerifiedStatus
1ConformerG-PAccuracy99.8Unverified