| BLOOM: A 176B-Parameter Open-Access Multilingual Language Model | Nov 9, 2022 | DecoderLanguage Modeling | CodeCode Available | 4 |
| On Bilingual Lexicon Induction with Large Language Models | Oct 21, 2023 | Bilingual Lexicon InductionCross-Lingual Word Embeddings | CodeCode Available | 1 |
| XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages | May 19, 2023 | In-Context LearningMultilingual NLP | CodeCode Available | 1 |
| Improving Bilingual Lexicon Induction with Cross-Encoder Reranking | Oct 30, 2022 | Bilingual Lexicon InductionCross Encoder Reranking | CodeCode Available | 1 |
| DetIE: Multilingual Open Information Extraction Inspired by Object Detection | Jun 24, 2022 | Multilingual NLPObject | CodeCode Available | 1 |
| Improving Word Translation via Two-Stage Contrastive Learning | Mar 15, 2022 | Bilingual Lexicon InductionContrastive Learning | CodeCode Available | 1 |
| Improving Word Translation via Two-Stage Contrastive Learning | Nov 16, 2021 | Bilingual Lexicon InductionContrastive Learning | CodeCode Available | 1 |
| WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER | Nov 1, 2021 | Domain AdaptationMultilingual Named Entity Recognition | CodeCode Available | 1 |
| HONEST: Measuring Hurtful Sentence Completion in Language Models | Jun 1, 2021 | Hate Speech DetectionHurtful Sentence Completion | CodeCode Available | 1 |
| Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages | Apr 12, 2021 | Machine TranslationMultilingual NLP | CodeCode Available | 1 |
| Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing | Jan 9, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| fugashi, a Tool for Tokenizing Japanese in Python | Oct 14, 2020 | Multilingual NLP | CodeCode Available | 1 |
| Language-agnostic BERT Sentence Embedding | Jul 3, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Simultaneous Translation and Paraphrase for Language Education | Jul 1, 2020 | Machine TranslationMultilingual NLP | CodeCode Available | 1 |
| PMIndia -- A Collection of Parallel Corpora of Languages of India | Jan 27, 2020 | Machine TranslationMultilingual NLP | CodeCode Available | 1 |
| Unsupervised Cross-lingual Representation Learning at Scale | Nov 5, 2019 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks | May 30, 2025 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| SenWiCh: Sense-Annotation of Low-Resource Languages for WiC using Hybrid Methods | May 29, 2025 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities | May 21, 2025 | MemorizationMultilingual NLP | —Unverified | 0 |
| LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization | May 21, 2025 | Distributed OptimizationMultilingual NLP | —Unverified | 0 |
| HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-Linguistic Transfer in Multilingual NLP: The Role of Language Families and Morphology | May 20, 2025 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| Multilingual Prompt Engineering in Large Language Models: A Survey Across NLP Tasks | May 16, 2025 | Multilingual NLPPrompt Engineering | —Unverified | 0 |
| Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting | Apr 15, 2025 | FairnessMultilingual NLP | —Unverified | 0 |
| Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models | Mar 19, 2025 | Fact CheckingFact Verification | —Unverified | 0 |