| Transliteration for Low-Resource Code-Switching Texts: Building an Automatic Cyrillic-to-Latin Converter for Tatar | Jun 1, 2021 | Language IdentificationTransliteration | —Unverified | 0 | 0 |
| TuGeBiC: A Turkish German Bilingual Code-Switching Corpus | May 2, 2022 | Language Identification | —Unverified | 0 | 0 |
| Turkish Native Language Identification | Jul 27, 2023 | Language IdentificationNative Language Identification | —Unverified | 0 | 0 |
| TwistBytes - Identification of Cuneiform Languages and German Dialects at VarDial 2019 | Jun 1, 2019 | Dialect IdentificationLanguage Identification | —Unverified | 0 | 0 |
| Twitter Language Identification Of Similar Languages And Dialects Without Ground Truth | Apr 1, 2017 | General ClassificationLanguage Identification | —Unverified | 0 | 0 |
| Twitter Universal Dependency Parsing for African-American and Mainstream American English | Jul 1, 2018 | Dependency ParsingInformation Retrieval | —Unverified | 0 | 0 |
| Two LRL \& Distractor Corpora from Web Information Retrieval and a Small Case Study in Language Identification without Training Corpora | May 1, 2020 | Information RetrievalLanguage Identification | —Unverified | 0 | 0 |
| Two-stage Training for Chinese Dialect Recognition | Aug 6, 2019 | Language IdentificationVocal Bursts Valence Prediction | —Unverified | 0 | 0 |
| Typological Features for Multilingual Delexicalised Dependency Parsing | Jun 1, 2019 | Dependency ParsingLanguage Identification | —Unverified | 0 | 0 |
| UJNLP at SemEval-2020 Task 12: Detecting Offensive Language Using Bidirectional Transformers | Dec 1, 2020 | Language IdentificationSentence | —Unverified | 0 | 0 |
| Universal and non-universal text statistics: Clustering coefficient for language identification | Nov 18, 2019 | ClusteringLanguage Identification | —Unverified | 0 | 0 |
| Universal Dependencies Treebank for Tatar: Incorporating Intra-Word Code-Switching Information | Jun 1, 2022 | Language IdentificationPOS | —Unverified | 0 | 0 |
| Unravelling Interlanguage Facts via Explainable Machine Learning | Aug 2, 2022 | BIG-bench Machine LearningLanguage Identification | —Unverified | 0 | 0 |
| Unsupervised Code-Switching for Multilingual Historical Document Transcription | May 1, 2015 | Language IdentificationLanguage Modeling | —Unverified | 0 | 0 |
| Unsupervised Deep Language and Dialect Identification for Short Texts | Dec 1, 2020 | Dialect IdentificationLanguage Identification | —Unverified | 0 | 0 |
| Unsupervised Feature Learning for Visual Sign Language Identification | Jun 1, 2014 | Information RetrievalLanguage Identification | —Unverified | 0 | 0 |
| Unsupervised neural adaptation model based on optimal transport for spoken language identification | Dec 24, 2020 | Language IdentificationSpoken language identification | —Unverified | 0 | 0 |
| Unsupervised Personality-Aware Language Identification | Sep 17, 2021 | Language Identification | —Unverified | 0 | 0 |
| Unsupervised Preference-Aware Language Identification | Nov 16, 2021 | Language Identification | —Unverified | 0 | 0 |
| UNT Linguistics at SemEval-2020 Task 12: Linear SVC with Pre-trained Word Embeddings as Document Vectors and Targeted Linguistic Features | Dec 1, 2020 | Language IdentificationWord Embeddings | —Unverified | 0 | 0 |
| Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpus | Aug 27, 2020 | Language Identification | —Unverified | 0 | 0 |
| Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpora | Dec 1, 2020 | Language Identification | —Unverified | 0 | 0 |
| URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors | Apr 1, 2017 | Language IdentificationLanguage Modeling | —Unverified | 0 | 0 |
| Using Classifier Features to Determine Language Transfer on Morphemes | Jun 1, 2018 | Cross-corpusLanguage Acquisition | —Unverified | 0 | 0 |
| Using Maximum Entropy Models to Discriminate between Similar Languages and Varieties | Aug 1, 2014 | Language IdentificationSentiment Analysis | —Unverified | 0 | 0 |
| Using N-gram and Word Network Features for Native Language Identification | Jun 1, 2013 | Language IdentificationNative Language Identification | —Unverified | 0 | 0 |
| Using Other Learner Corpora in the 2013 NLI Shared Task | Jun 1, 2013 | Domain AdaptationLanguage Identification | —Unverified | 0 | 0 |
| Using Shallow Syntactic Features to Measure Influences of L1 and Proficiency Level in EFL Writings | May 1, 2015 | Language Identification | —Unverified | 0 | 0 |
| Using Social Networks to Improve Language Variety Identification with Neural Networks | Nov 1, 2017 | Language Identification | —Unverified | 0 | 0 |
| Utterance-level end-to-end language identification using attention-based CNN-BLSTM | Feb 20, 2019 | Language Identification | —Unverified | 0 | 0 |
| Validating and Exploring Large Geographic Corpora | Mar 13, 2024 | Language IdentificationOutlier Detection | —Unverified | 0 | 0 |
| Vanilla Classifiers for Distinguishing between Similar Languages | Dec 1, 2016 | Information RetrievalLanguage Identification | —Unverified | 0 | 0 |
| VarClass: An Open-source Language Identification Tool for Language Varieties | May 1, 2014 | Information RetrievalLanguage Identification | —Unverified | 0 | 0 |
| VAST: A Corpus of Video Annotation for Speech Technologies | May 1, 2018 | Action DetectionLanguage Identification | —Unverified | 0 | 0 |
| Vector Space Model as Cognitive Space for Text Classification | Aug 21, 2017 | Author ProfilingClassification | —Unverified | 0 | 0 |
| Vers la correction automatique de textes bruit\'es: Architecture g\'en\'erale et d\'etermination de la langue d'un mot inconnu (Towards Automatic Spell-Checking of Noisy Texts : General Architecture and Language Identification for Unknown Words) [in French] | Jun 1, 2012 | Language IdentificationSpelling Correction | —Unverified | 0 | 0 |
| Visual Script and Language Identification | Jan 8, 2016 | Language Identification | —Unverified | 0 | 0 |
| Vocabulary-Based Language Similarity using Web Corpora | May 1, 2014 | Language IdentificationTransliteration | —Unverified | 0 | 0 |
| VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION | Nov 25, 2020 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| VTEX System Description for the NLI 2013 Shared Task | Jun 1, 2013 | Language IdentificationText Classification | —Unverified | 0 | 0 |
| Wavelet Scattering Transform for Improving Generalization in Low-Resourced Spoken Language Identification | Oct 1, 2023 | Language IdentificationSpoken language identification | —Unverified | 0 | 0 |
| When Sparse Traditional Models Outperform Dense Neural Networks: the Curious Case of Discriminating between Similar Languages | Apr 1, 2017 | Language Identification | —Unverified | 0 | 0 |
| Whispy: Adapting STT Whisper Models to Real-Time Environments | May 6, 2024 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| WLV-RIT at HASOC-Dravidian-CodeMix-FIRE2020: Offensive Language Identification in Code-switched YouTube Comments | Nov 1, 2020 | Language IdentificationTransfer Learning | —Unverified | 0 | 0 |
| WOLI at SemEval-2020 Task 12: Arabic Offensive Language Identification on Different Twitter Datasets | Sep 11, 2020 | Language Identification | —Unverified | 0 | 0 |
| Word-Level Language Identification and Predicting Codeswitching Points in Swahili-English Language Data | Nov 1, 2016 | Language IdentificationSentiment Analysis | —Unverified | 0 | 0 |
| Word-level Language Identification in Bi-lingual Code-switched Texts | Dec 1, 2014 | Language IdentificationOpinion Mining | —Unverified | 0 | 0 |
| Word Level Language Identification in English Telugu Code Mixed Data | Oct 9, 2020 | Language IdentificationMachine Translation | —Unverified | 0 | 0 |
| Word Level Language Identification in Online Multilingual Communication | Oct 1, 2013 | Document ClassificationLanguage Identification | —Unverified | 0 | 0 |
| Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System | Oct 1, 2014 | Language Identification | —Unverified | 0 | 0 |