| A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese | Oct 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages | Dec 11, 2022 | Natural Language UnderstandingXLM-R | CodeCode Available | 1 | 5 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages | May 20, 2023 | Language ModellingXLM-R | CodeCode Available | 1 | 5 |
| DUMB: A Benchmark for Smart Evaluation of Dutch Models | May 22, 2023 | XLM-R | CodeCode Available | 1 | 5 |
| GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge | Sep 26, 2024 | Natural Language InferenceSentiment Analysis | CodeCode Available | 1 | 5 |
| ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic | Dec 27, 2020 | DiversityXLM-R | CodeCode Available | 1 | 5 |
| Emotion Classification in a Resource Constrained Language Using Transformer-based Approach | Apr 17, 2021 | ClassificationEmotion Classification | CodeCode Available | 1 | 5 |
| BERTweet: A pre-trained language model for English Tweets | May 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Applying Occam's Razor to Transformer-Based Dependency Parsing: What Works, What Doesn't, and What is Really Necessary | Oct 23, 2020 | Dependency ParsingPart-Of-Speech Tagging | CodeCode Available | 1 | 5 |