| A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese | Oct 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GREEK-BERT: The Greeks visiting Sesame Street | Aug 27, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic | Dec 27, 2020 | DiversityXLM-R | CodeCode Available | 1 |
| Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages | Dec 11, 2022 | Natural Language UnderstandingXLM-R | CodeCode Available | 1 |
| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages | May 20, 2023 | Language ModellingXLM-R | CodeCode Available | 1 |
| COVID-19 Named Entity Recognition for Vietnamese | Apr 8, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| DUMB: A Benchmark for Smart Evaluation of Dutch Models | May 22, 2023 | XLM-R | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| BERTweet: A pre-trained language model for English Tweets | May 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |