| On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations? | Jun 20, 2024 | Machine TranslationMultilingual NLP | —Unverified | 0 |
| ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus | Jul 14, 2021 | Multilingual NLPTransfer Learning | —Unverified | 0 |
| Patterns of Persistence and Diffusibility across the World's Languages | Jan 3, 2024 | Multilingual NLP | —Unverified | 0 |
| Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models | Mar 19, 2025 | Fact CheckingFact Verification | —Unverified | 0 |
| Polyglot: Distributed Word Representations for Multilingual NLP | Jul 5, 2013 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pragmatic information in translation: a corpus-based study of tense and mood in English and German | Jul 10, 2020 | Machine TranslationMultilingual NLP | —Unverified | 0 |
| Predicting the Performance of Multilingual NLP Models | Oct 17, 2021 | Multilingual NLP | —Unverified | 0 |
| Representation and Bias in Multilingual NLP: Insights from Controlled Experiments on Conditional Language Modeling | Jan 1, 2021 | FairnessLanguage Modeling | —Unverified | 0 |
| SandboxAQ's submission to MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval | Oct 28, 2024 | Information RetrievalMultilingual Named Entity Recognition | —Unverified | 0 |
| Semantic Clustering of Pivot Paraphrases | May 1, 2014 | ClusteringMachine Translation | —Unverified | 0 |
| SenWiCh: Sense-Annotation of Low-Resource Languages for WiC using Hybrid Methods | May 29, 2025 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities | May 21, 2025 | MemorizationMultilingual NLP | —Unverified | 0 |
| Survey on the Use of Typological Information in Natural Language Processing | Oct 11, 2016 | Multilingual NLPSurvey | —Unverified | 0 |
| The MultiTal NLP tool infrastructure | Dec 1, 2016 | Multilingual NLP | —Unverified | 0 |
| URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors | Apr 1, 2017 | Language IdentificationLanguage Modeling | —Unverified | 0 |
| XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment | Apr 17, 2021 | Machine TranslationMultilingual NLP | —Unverified | 0 |
| Zero-shot Cross-lingual Transfer without Parallel Corpus | Oct 7, 2023 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| Introducing Syllable Tokenization for Low-resource Languages: A Case Study with Swahili | Mar 26, 2024 | Multilingual NLPText Generation | —Unverified | 0 |
| Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models | Mar 15, 2024 | AllMultilingual NLP | —Unverified | 0 |
| Improving Cross-Lingual Word Embeddings by Meeting in the Middle | Aug 27, 2018 | Cross-Lingual Word EmbeddingsMultilingual NLP | CodeCode Available | 0 |
| XeroAlign: Zero-Shot Cross-lingual Transformer Alignment | May 6, 2021 | Multilingual NLPNatural Language Understanding | CodeCode Available | 0 |
| Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis Models | May 7, 2024 | Multilingual NLPSentiment Analysis | CodeCode Available | 0 |
| ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models | Jun 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Analysing The Impact Of Linguistic Features On Cross-Lingual Transfer | May 12, 2021 | Cross-Lingual TransferMultilingual NLP | CodeCode Available | 0 |
| Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs | May 22, 2023 | Multilingual NLPRetrieval | CodeCode Available | 0 |
| HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response | Oct 10, 2022 | HumanitarianMultilabel Text Classification | CodeCode Available | 0 |
| UQA: Corpus for Urdu Question Answering | May 2, 2024 | Multilingual NLPQuestion Answering | CodeCode Available | 0 |
| Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis | Dec 1, 2020 | ClusteringMultilingual NLP | CodeCode Available | 0 |
| MMCR4NLP: Multilingual Multiway Corpora Repository for Natural Language Processing | Oct 3, 2017 | Machine TranslationMultilingual NLP | CodeCode Available | 0 |
| Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca | Sep 16, 2023 | Instruction FollowingLarge Language Model | CodeCode Available | 0 |
| Self-Augmentation Improves Zero-Shot Cross-Lingual Transfer | Sep 19, 2023 | Cross-Lingual TransferMultilingual NLP | CodeCode Available | 0 |
| Self-Augmented In-Context Learning for Unsupervised Word Translation | Feb 15, 2024 | Bilingual Lexicon InductionCross-Lingual Word Embeddings | CodeCode Available | 0 |
| A General-Purpose Multilingual Document Encoder | May 11, 2023 | Cross-Lingual TransferDocument Classification | CodeCode Available | 0 |
| What Drives Performance in Multilingual Language Models? | Apr 29, 2024 | Cross-Lingual TransferMultilingual NLP | CodeCode Available | 0 |
| News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation | Jun 18, 2024 | Cross-Lingual TransferDomain Adaptation | CodeCode Available | 0 |
| Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation | Jun 4, 2019 | Multilingual Named Entity RecognitionMultilingual NLP | CodeCode Available | 0 |
| What is "Typological Diversity" in NLP? | Feb 6, 2024 | DiversityMultilingual NLP | CodeCode Available | 0 |
| Evaluating The Effectiveness of Capsule Neural Network in Toxic Comment Classification using Pre-trained BERT Embeddings | Oct 12, 2023 | Multilingual NLPNatural Language Understanding | CodeCode Available | 0 |
| SICK-NL: A Dataset for Dutch Natural Language Inference | Apr 1, 2021 | Multilingual NLPNatural Language Inference | CodeCode Available | 0 |
| SICKNL: A Dataset for Dutch Natural Language Inference | Jan 14, 2021 | Multilingual NLPNatural Language Inference | CodeCode Available | 0 |
| BaitBuster-Bangla: A Comprehensive Dataset for Clickbait Detection in Bangla with Multi-Feature and Multi-Modal Analysis | Oct 13, 2023 | ClassificationClickbait Detection | CodeCode Available | 0 |
| PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation | Apr 3, 2023 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| Cultural and Geographical Influences on Image Translatability of Words across Languages | Jun 1, 2021 | Cultural Vocal Bursts Intensity PredictionLow Resource Neural Machine Translation | CodeCode Available | 0 |
| PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India | May 15, 2023 | Cross-Lingual Abstractive SummarizationMultilingual NLP | CodeCode Available | 0 |
| A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets | Mar 6, 2024 | DiversityMultilingual NLP | CodeCode Available | 0 |
| TeDDi Sample: Text Data Diversity Sample for Language Comparison and Multilingual NLP | Jun 1, 2022 | DiversityMultilingual NLP | CodeCode Available | 0 |