| Multilingual Large Language Models and Curse of Multilinguality | Jun 15, 2024 | DecoderXLM-R | —Unverified | 0 | 0 |
| Multilingual Pre-training with Universal Dependency Learning | Dec 1, 2021 | Dependency ParsingLanguage Modeling | —Unverified | 0 | 0 |
| Massively Multilingual Lexical Specialization of Multilingual Transformers | Aug 1, 2022 | Bilingual Lexicon InductionRetrieval | —Unverified | 0 | 0 |
| Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching | Nov 16, 2021 | Contrastive LearningKnowledge Distillation | —Unverified | 0 | 0 |
| VTCC-NLP at NL4Opt competition subtask 1: An Ensemble Pre-trained language models for Named Entity Recognition | Dec 14, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 | 0 |
| Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval | Jan 17, 2025 | Data AugmentationDomain Adaptation | —Unverified | 0 | 0 |
| Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages | Mar 1, 2021 | Depth EstimationDepth Prediction | —Unverified | 0 | 0 |
| What does it mean to be language-agnostic? Probing multilingual sentence encoders for typological properties | Sep 27, 2020 | SentenceXLM-R | —Unverified | 0 | 0 |
| NICT Kyoto Submission for the WMT’21 Quality Estimation Task: Multimetric Multilingual Pretraining for Critical Error Detection | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Multilingual Reading Comprehension System for more than 100 Languages | Dec 1, 2020 | Machine Reading ComprehensionMachine Translation | —Unverified | 0 | 0 |
| On Learning Universal Representations Across Languages | Jul 31, 2020 | Contrastive LearningCross-Lingual Natural Language Inference | —Unverified | 0 | 0 |
| Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings | Apr 30, 2020 | Cross-Lingual TransferModel Selection | —Unverified | 0 | 0 |
| On the Universality of Deep Contextual Language Models | Sep 15, 2021 | Cross-Lingual TransferXLM-R | —Unverified | 0 | 0 |
| ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling | Jan 4, 2022 | Classificationtext-classification | —Unverified | 0 | 0 |
| Automatic Sexism Detection with Multilingual Transformer Models | Jun 9, 2021 | Binary ClassificationClassification | —Unverified | 0 | 0 |
| Priberam Labs at the 3rd Shared Task on SlavNER | Apr 1, 2021 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 | 0 |
| AmaSQuAD: A Benchmark for Amharic Extractive Question Answering | Feb 4, 2025 | Extractive Question-AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Prix-LM: Pretraining for Multilingual Knowledge Base Construction | Nov 16, 2021 | Bilingual Lexicon InductionCausal Language Modeling | —Unverified | 0 | 0 |
| XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation | Nov 1, 2020 | Natural Language UnderstandingXLM-R | —Unverified | 0 | 0 |
| ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification | Sep 19, 2022 | ArticlesLexical Simplification | —Unverified | 0 | 0 |
| Retrofitting Large Language Models with Dynamic Tokenization | Nov 27, 2024 | DecoderFairness | —Unverified | 0 | 0 |
| RobertNLP at the IWPT 2021 Shared Task: Simple Enhanced UD Parsing for 17 Languages | Aug 1, 2021 | Dependency ParsingXLM-R | —Unverified | 0 | 0 |
| Saliency-based Multi-View Mixed Language Training for Zero-shot Cross-lingual Classification | Nov 1, 2021 | Cross-Lingual Sentiment ClassificationDialogue State Tracking | —Unverified | 0 | 0 |
| Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models | Jul 12, 2023 | QuantizationXLM-R | —Unverified | 0 | 0 |
| SenseCluster at SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection | Dec 1, 2020 | Change DetectionClustering | —Unverified | 0 | 0 |
| Siamese Networks for Inference in Malayalam Language Texts | Sep 1, 2021 | Binary ClassificationClassification | —Unverified | 0 | 0 |
| SkoltechNLP at SemEval-2021 Task 2: Generating Cross-Lingual Training Data for the Word-in-Context Task | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget | Jul 1, 2022 | GPUNER | —Unverified | 0 | 0 |
| SMTCE: A Social Media Text Classification Evaluation Benchmark and BERTology Models for Vietnamese | Sep 21, 2022 | Classificationtext-classification | —Unverified | 0 | 0 |
| Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024 | Apr 23, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 | 0 |
| Solution for Emotion Prediction Competition of Workshop on Emotionally and Culturally Intelligent AI | Mar 26, 2024 | DiversityXLM-R | —Unverified | 0 | 0 |
| Subasa -- Adapting Language Models for Low-resourced Offensive Language Detection in Sinhala | Apr 2, 2025 | XLM-R | —Unverified | 0 | 0 |
| Diagnosing Transformers in Task-Oriented Semantic Parsing | May 27, 2021 | Semantic Parsingvalid | —Unverified | 0 | 0 |
| Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language Models | Aug 17, 2023 | FairnessXLM-R | —Unverified | 0 | 0 |
| Targeted Multilingual Adaptation for Low-resource Language Families | May 20, 2024 | XLM-R | —Unverified | 0 | 0 |
| Team “DaDeFrNi” at CASE 2021 Task 1: Document and Sentence Classification for Protest Event Detection | Aug 1, 2021 | ArticlesBinary Classification | —Unverified | 0 | 0 |
| Distilling Large Language Models into Tiny and Effective Students using pQRNN | Jan 21, 2021 | Data AugmentationSemantic Parsing | —Unverified | 0 | 0 |
| DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuning | May 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Team Rouges at SemEval-2020 Task 12: Cross-lingual Inductive Transfer to Detect Offensive Language | Dec 1, 2020 | Language IdentificationPosition | —Unverified | 0 | 0 |
| Do Multilingual Language Models Capture Differing Moral Norms? | Mar 18, 2022 | SentenceXLM-R | —Unverified | 0 | 0 |
| Do Not Fire the Linguist: Grammatical Profiles Help Language Models Detect Semantic Change | Apr 12, 2022 | Change DetectionLanguage Modeling | —Unverified | 0 | 0 |
| Debating Europe: A Multilingual Multi-Target Stance Classification Dataset of Online Debates | Jun 1, 2022 | Stance ClassificationXLM-R | —Unverified | 0 | 0 |
| CUET-NLP@TamilNLP-ACL2022: Multi-Class Textual Emotion Detection from Social Media using Transformer | May 1, 2022 | Emotion RecognitionOpinion Mining | —Unverified | 0 | 0 |
| Emotion Stimulus Detection in German News Headlines | Jul 27, 2021 | Emotion RecognitionSentence | —Unverified | 0 | 0 |
| English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too | May 26, 2020 | Cross-Lingual TransferHellaSwag | —Unverified | 0 | 0 |
| English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too | Jun 3, 2020 | Cross-Lingual TransferQuestion Answering | —Unverified | 0 | 0 |
| Cross-neutralising: Probing for joint encoding of linguistic information in multilingual models | Oct 24, 2020 | SentenceXLM-R | —Unverified | 0 | 0 |
| TEET! Tunisian Dataset for Toxic Speech Detection | Oct 11, 2021 | Feature EngineeringXLM-R | —Unverified | 0 | 0 |
| Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models | Jan 26, 2025 | XLM-R | —Unverified | 0 | 0 |
| TeluguNER: Leveraging Multi-Domain Named Entity Recognition with Deep Transformers | May 1, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 | 0 |