| BLOOM: A 176B-Parameter Open-Access Multilingual Language Model | Nov 9, 2022 | DecoderLanguage Modeling | CodeCode Available | 4 | 5 |
| HONEST: Measuring Hurtful Sentence Completion in Language Models | Jun 1, 2021 | Hate Speech DetectionHurtful Sentence Completion | CodeCode Available | 1 | 5 |
| Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages | Apr 12, 2021 | Machine TranslationMultilingual NLP | CodeCode Available | 1 | 5 |
| DetIE: Multilingual Open Information Extraction Inspired by Object Detection | Jun 24, 2022 | Multilingual NLPObject | CodeCode Available | 1 | 5 |
| Improving Word Translation via Two-Stage Contrastive Learning | Nov 16, 2021 | Bilingual Lexicon InductionContrastive Learning | CodeCode Available | 1 | 5 |
| Improving Bilingual Lexicon Induction with Cross-Encoder Reranking | Oct 30, 2022 | Bilingual Lexicon InductionCross Encoder Reranking | CodeCode Available | 1 | 5 |
| Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing | Jan 9, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 | 5 |
| Simultaneous Translation and Paraphrase for Language Education | Jul 1, 2020 | Machine TranslationMultilingual NLP | CodeCode Available | 1 | 5 |
| Unsupervised Cross-lingual Representation Learning at Scale | Nov 5, 2019 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 | 5 |
| XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages | May 19, 2023 | In-Context LearningMultilingual NLP | CodeCode Available | 1 | 5 |
| Improving Word Translation via Two-Stage Contrastive Learning | Mar 15, 2022 | Bilingual Lexicon InductionContrastive Learning | CodeCode Available | 1 | 5 |
| Language-agnostic BERT Sentence Embedding | Jul 3, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER | Nov 1, 2021 | Domain AdaptationMultilingual Named Entity Recognition | CodeCode Available | 1 | 5 |
| On Bilingual Lexicon Induction with Large Language Models | Oct 21, 2023 | Bilingual Lexicon InductionCross-Lingual Word Embeddings | CodeCode Available | 1 | 5 |
| fugashi, a Tool for Tokenizing Japanese in Python | Oct 14, 2020 | Multilingual NLP | CodeCode Available | 1 | 5 |
| PMIndia -- A Collection of Parallel Corpora of Languages of India | Jan 27, 2020 | Machine TranslationMultilingual NLP | CodeCode Available | 1 | 5 |
| A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets | Mar 6, 2024 | DiversityMultilingual NLP | CodeCode Available | 0 | 5 |
| Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca | Sep 16, 2023 | Instruction FollowingLarge Language Model | CodeCode Available | 0 | 5 |
| MMCR4NLP: Multilingual Multiway Corpora Repository for Natural Language Processing | Oct 3, 2017 | Machine TranslationMultilingual NLP | CodeCode Available | 0 | 5 |
| News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation | Jun 18, 2024 | Cross-Lingual TransferDomain Adaptation | CodeCode Available | 0 | 5 |
| PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation | Apr 3, 2023 | DenoisingLanguage Modeling | CodeCode Available | 0 | 5 |
| Analysing The Impact Of Linguistic Features On Cross-Lingual Transfer | May 12, 2021 | Cross-Lingual TransferMultilingual NLP | CodeCode Available | 0 | 5 |
| Cultural and Geographical Influences on Image Translatability of Words across Languages | Jun 1, 2021 | Cultural Vocal Bursts Intensity PredictionLow Resource Neural Machine Translation | CodeCode Available | 0 | 5 |
| Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis Models | May 7, 2024 | Multilingual NLPSentiment Analysis | CodeCode Available | 0 | 5 |
| BaitBuster-Bangla: A Comprehensive Dataset for Clickbait Detection in Bangla with Multi-Feature and Multi-Modal Analysis | Oct 13, 2023 | ClassificationClickbait Detection | CodeCode Available | 0 | 5 |