| Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing | Jan 9, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| fugashi, a Tool for Tokenizing Japanese in Python | Oct 14, 2020 | Multilingual NLP | CodeCode Available | 1 |
| Language-agnostic BERT Sentence Embedding | Jul 3, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Simultaneous Translation and Paraphrase for Language Education | Jul 1, 2020 | Machine TranslationMultilingual NLP | CodeCode Available | 1 |
| PMIndia -- A Collection of Parallel Corpora of Languages of India | Jan 27, 2020 | Machine TranslationMultilingual NLP | CodeCode Available | 1 |
| Unsupervised Cross-lingual Representation Learning at Scale | Nov 5, 2019 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks | May 30, 2025 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| SenWiCh: Sense-Annotation of Low-Resource Languages for WiC using Hybrid Methods | May 29, 2025 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization | May 21, 2025 | Distributed OptimizationMultilingual NLP | —Unverified | 0 |
| Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities | May 21, 2025 | MemorizationMultilingual NLP | —Unverified | 0 |