| A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese | Oct 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Bilingual Lexicon Induction with Cross-Encoder Reranking | Oct 30, 2022 | Bilingual Lexicon InductionCross Encoder Reranking | CodeCode Available | 1 |
| DUMB: A Benchmark for Smart Evaluation of Dutch Models | May 22, 2023 | XLM-R | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain | Jan 30, 2023 | XLM-R | CodeCode Available | 1 |
| Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media | Oct 27, 2023 | Cross-Lingual TransferFact Checking | CodeCode Available | 1 |
| ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic | Dec 27, 2020 | DiversityXLM-R | CodeCode Available | 1 |
| COVID-19 Named Entity Recognition for Vietnamese | Apr 8, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| BERTweet: A pre-trained language model for English Tweets | May 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |