| BLOOM: A 176B-Parameter Open-Access Multilingual Language Model | Nov 9, 2022 | DecoderLanguage Modeling | CodeCode Available | 4 | 5 |
| On Bilingual Lexicon Induction with Large Language Models | Oct 21, 2023 | Bilingual Lexicon InductionCross-Lingual Word Embeddings | CodeCode Available | 1 | 5 |
| fugashi, a Tool for Tokenizing Japanese in Python | Oct 14, 2020 | Multilingual NLP | CodeCode Available | 1 | 5 |
| Improving Word Translation via Two-Stage Contrastive Learning | Nov 16, 2021 | Bilingual Lexicon InductionContrastive Learning | CodeCode Available | 1 | 5 |
| DetIE: Multilingual Open Information Extraction Inspired by Object Detection | Jun 24, 2022 | Multilingual NLPObject | CodeCode Available | 1 | 5 |
| Language-agnostic BERT Sentence Embedding | Jul 3, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| HONEST: Measuring Hurtful Sentence Completion in Language Models | Jun 1, 2021 | Hate Speech DetectionHurtful Sentence Completion | CodeCode Available | 1 | 5 |
| Improving Word Translation via Two-Stage Contrastive Learning | Mar 15, 2022 | Bilingual Lexicon InductionContrastive Learning | CodeCode Available | 1 | 5 |
| Improving Bilingual Lexicon Induction with Cross-Encoder Reranking | Oct 30, 2022 | Bilingual Lexicon InductionCross Encoder Reranking | CodeCode Available | 1 | 5 |
| PMIndia -- A Collection of Parallel Corpora of Languages of India | Jan 27, 2020 | Machine TranslationMultilingual NLP | CodeCode Available | 1 | 5 |