| AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities | Nov 12, 2022 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 4 |
| AdapterHub: A Framework for Adapting Transformers | Jul 15, 2020 | XLM-R | CodeCode Available | 2 |
| MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages | Apr 18, 2022 | intent-classificationIntent Classification | CodeCode Available | 2 |
| DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing | Nov 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer | Apr 30, 2020 | Cross-Lingual Transfernamed-entity-recognition | CodeCode Available | 2 |
| Zero-Shot Tokenizer Transfer | May 13, 2024 | XLM-R | CodeCode Available | 2 |
| X^2-VLM: All-In-One Pre-trained Model For Vision-Language Tasks | Nov 22, 2022 | AllCross-Modal Retrieval | CodeCode Available | 2 |
| A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese | Oct 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| X-METRA-ADA: Cross-lingual Meta-Transfer Learning Adaptation to Natural Language Understanding and Question Answering | Apr 20, 2021 | Cross-Lingual TransferMeta-Learning | CodeCode Available | 1 |
| Multilingual Sentence Transformer as A Multilingual Word Aligner | Jan 28, 2023 | SentenceWord Alignment | CodeCode Available | 1 |
| XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation | Apr 3, 2020 | Natural Language UnderstandingXLM-R | CodeCode Available | 1 |
| Emotion Classification in a Resource Constrained Language Using Transformer-based Approach | Apr 17, 2021 | ClassificationEmotion Classification | CodeCode Available | 1 |
| ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment | May 23, 2023 | BenchmarkingCross-Lingual Transfer | CodeCode Available | 1 |
| Towards Explainable Evaluation Metrics for Natural Language Generation | Mar 21, 2022 | Machine TranslationText Generation | CodeCode Available | 1 |
| ViHOS: Hate Speech Spans Detection for Vietnamese | Jan 24, 2023 | Hate Span IdentificationSequence-to-sequence Language Modeling | CodeCode Available | 1 |
| The Geometry of Multilingual Language Model Representations | May 22, 2022 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| VLUE: A Multi-Task Benchmark for Evaluating Vision-Language Models | May 30, 2022 | Vietnamese Natural Language UnderstandingXLM-R | CodeCode Available | 1 |
| LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain | Jan 30, 2023 | XLM-R | CodeCode Available | 1 |
| Inducing Language-Agnostic Multilingual Representations | Aug 20, 2020 | Cross-Lingual TransferSentence | CodeCode Available | 1 |
| PhoBERT: Pre-trained language models for Vietnamese | Mar 2, 2020 | Dependency Parsingnamed-entity-recognition | CodeCode Available | 1 |
| Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation | May 1, 2022 | Abstractive Text SummarizationCross-Lingual Abstractive Summarization | CodeCode Available | 1 |
| Unsupervised Cross-lingual Representation Learning at Scale | Nov 5, 2019 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media | Oct 27, 2023 | Cross-Lingual TransferFact Checking | CodeCode Available | 1 |
| XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond | Apr 25, 2021 | Language ModellingSentiment Analysis | CodeCode Available | 1 |
| Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages | May 20, 2023 | Language ModellingXLM-R | CodeCode Available | 1 |
| COVID-19 Named Entity Recognition for Vietnamese | Apr 8, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic | Dec 27, 2020 | DiversityXLM-R | CodeCode Available | 1 |
| DUMB: A Benchmark for Smart Evaluation of Dutch Models | May 22, 2023 | XLM-R | CodeCode Available | 1 |
| BERTweet: A pre-trained language model for English Tweets | May 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation | Oct 16, 2021 | Abstractive Text SummarizationCross-Lingual Abstractive Summarization | CodeCode Available | 1 |
| GREEK-BERT: The Greeks visiting Sesame Street | Aug 27, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Applying Occam's Razor to Transformer-Based Dependency Parsing: What Works, What Doesn't, and What is Really Necessary | Oct 23, 2020 | Dependency ParsingPart-Of-Speech Tagging | CodeCode Available | 1 |
| Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages | Dec 11, 2022 | Natural Language UnderstandingXLM-R | CodeCode Available | 1 |
| IndoNLI: A Natural Language Inference Dataset for Indonesian | Oct 27, 2021 | Natural Language InferenceSentence | CodeCode Available | 1 |
| Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference | Jun 7, 2021 | Cross-Lingual TransferNatural Language Inference | CodeCode Available | 1 |
| Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots | Mar 17, 2021 | Cross-Lingual TransferXLM-R | CodeCode Available | 1 |
| Improving Bilingual Lexicon Induction with Cross-Encoder Reranking | Oct 30, 2022 | Bilingual Lexicon InductionCross Encoder Reranking | CodeCode Available | 1 |
| GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge | Sep 26, 2024 | Natural Language InferenceSentiment Analysis | CodeCode Available | 1 |
| Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-Tuning | Apr 13, 2022 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 1 |
| AmaSQuAD: A Benchmark for Amharic Extractive Question Answering | Feb 4, 2025 | Extractive Question-AnsweringQuestion Answering | —Unverified | 0 |
| Do Not Fire the Linguist: Grammatical Profiles Help Language Models Detect Semantic Change | Apr 12, 2022 | Change DetectionLanguage Modeling | —Unverified | 0 |
| BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification | Aug 16, 2022 | Classificationtext-classification | —Unverified | 0 |
| Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems | Jun 15, 2022 | Cross-Lingual Natural Language Inferenceintent-classification | —Unverified | 0 |
| Applying Occam’s Razor to Transformer-Based Dependency Parsing: What Works, What Doesn’t, and What is Really Necessary | Aug 1, 2021 | Dependency ParsingWord Embeddings | —Unverified | 0 |
| A Primer on Pretrained Multilingual Language Models | Jul 1, 2021 | Joint Multilingual Sentence RepresentationsMultilingual text classification | —Unverified | 0 |
| DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuning | May 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BabyLMs for isiXhosa: Data-Efficient Language Modelling in a Low-Resource Context | Jan 7, 2025 | Language ModellingNER | —Unverified | 0 |
| Massively Multilingual Lexical Specialization of Multilingual Transformers | Aug 1, 2022 | Bilingual Lexicon InductionRetrieval | —Unverified | 0 |