| Cross-Linguistic Transfer in Multilingual NLP: The Role of Language Families and Morphology | May 20, 2025 | Cross-Lingual TransferMultilingual NLP | —Unverified | 0 |
| Subasa -- Adapting Language Models for Low-resourced Offensive Language Detection in Sinhala | Apr 2, 2025 | XLM-R | —Unverified | 0 |
| Multilingual Encoder Knows more than You Realize: Shared Weights Pretraining for Extremely Low-Resource Languages | Feb 15, 2025 | DecoderText Generation | CodeCode Available | 0 |
| AmaSQuAD: A Benchmark for Amharic Extractive Question Answering | Feb 4, 2025 | Extractive Question-AnsweringQuestion Answering | —Unverified | 0 |
| Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models | Jan 26, 2025 | XLM-R | —Unverified | 0 |
| Comparative Approaches to Sentiment Analysis Using Datasets in Major European and Arabic Languages | Jan 21, 2025 | Sentiment AnalysisSentiment Classification | —Unverified | 0 |
| FuocChuVIP123 at CoMeDi Shared Task: Disagreement Ranking with XLM-Roberta Sentence Embeddings and Deep Neural Regression | Jan 21, 2025 | SentenceSentence Embeddings | —Unverified | 0 |
| Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval | Jan 17, 2025 | Data AugmentationDomain Adaptation | —Unverified | 0 |
| BabyLMs for isiXhosa: Data-Efficient Language Modelling in a Low-Resource Context | Jan 7, 2025 | Language ModellingNER | —Unverified | 0 |
| USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness Task | Nov 28, 2024 | Information RetrievalMachine Translation | —Unverified | 0 |
| Retrofitting Large Language Models with Dynamic Tokenization | Nov 27, 2024 | DecoderFairness | —Unverified | 0 |
| Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese | Nov 20, 2024 | Natural Language InferenceXLM-R | —Unverified | 0 |
| From N-grams to Pre-trained Multilingual Models For Language Identification | Oct 11, 2024 | Language IdentificationXLM-R | CodeCode Available | 0 |
| GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge | Sep 26, 2024 | Natural Language InferenceSentiment Analysis | CodeCode Available | 1 |
| LangSAMP: Language-Script Aware Multilingual Pretraining | Sep 26, 2024 | Continual PretrainingLanguage Modeling | CodeCode Available | 0 |
| mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval | Jul 29, 2024 | Contrastive LearningReranking | —Unverified | 0 |
| The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models | Jun 27, 2024 | Cross-Lingual TransferSentiment Analysis | —Unverified | 0 |
| Medical Spoken Named Entity Recognition | Jun 19, 2024 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Multilingual Large Language Models and Curse of Multilinguality | Jun 15, 2024 | DecoderXLM-R | —Unverified | 0 |
| Exploring Alignment in Shared Cross-lingual Spaces | May 23, 2024 | Machine Translationnamed-entity-recognition | CodeCode Available | 0 |
| Targeted Multilingual Adaptation for Low-resource Language Families | May 20, 2024 | XLM-R | —Unverified | 0 |
| Zero-Shot Tokenizer Transfer | May 13, 2024 | XLM-R | CodeCode Available | 2 |
| Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024 | Apr 23, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Adapting Mental Health Prediction Tasks for Cross-lingual Learning via Meta-Training and In-context Learning with Large Language Model | Apr 13, 2024 | Cross-Lingual TransferIn-Context Learning | —Unverified | 0 |
| MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness | Apr 3, 2024 | Cross-Lingual TransferData Augmentation | —Unverified | 0 |
| Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets | Mar 29, 2024 | Cross-Lingual Transfernamed-entity-recognition | CodeCode Available | 0 |
| Solution for Emotion Prediction Competition of Workshop on Emotionally and Culturally Intelligent AI | Mar 26, 2024 | DiversityXLM-R | —Unverified | 0 |
| CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification | Mar 18, 2024 | Conformal PredictionIn-Context Learning | CodeCode Available | 0 |
| Machines Do See Color: A Guideline to Classify Different Forms of Racist Discourse in Large Corpora | Jan 17, 2024 | text-classificationText Classification | —Unverified | 0 |
| LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization | Jan 11, 2024 | intent-classificationIntent Classification | —Unverified | 0 |
| Hate Speech and Offensive Content Detection in Indo-Aryan Languages: A Battle of LSTM and Transformers | Dec 9, 2023 | Hate Speech DetectionModel Selection | —Unverified | 0 |
| A Text-to-Text Model for Multilingual Offensive Language Identification | Dec 6, 2023 | DecoderLanguage Identification | —Unverified | 0 |
| KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model | Nov 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MELA: Multilingual Evaluation of Linguistic Acceptability | Nov 15, 2023 | Code GenerationCross-Lingual Transfer | CodeCode Available | 0 |
| Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study | Nov 11, 2023 | Cross-Lingual Sentiment ClassificationCross-Lingual Transfer | —Unverified | 0 |
| Counterfactually Probing Language Identity in Multilingual Models | Oct 29, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media | Oct 27, 2023 | Cross-Lingual TransferFact Checking | CodeCode Available | 1 |
| Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering | Oct 20, 2023 | Cross-Lingual TransferPOS | CodeCode Available | 0 |
| MedAI Dialog Corpus (MEDIC): Zero-Shot Classification of Doctor and AI Responses in Health Consultations | Oct 19, 2023 | Classificationtext-classification | —Unverified | 0 |
| ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| QASiNa: Religious Domain Question Answering using Sirah Nabawiyah | Oct 12, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Exploring the Maze of Multilingual Modeling | Oct 9, 2023 | Language ModellingModel Selection | —Unverified | 0 |
| Mixed-Distil-BERT: Code-mixed Language Modeling for Bangla, English, and Hindi | Sep 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language Models | Aug 17, 2023 | FairnessXLM-R | —Unverified | 0 |
| Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models | Jul 12, 2023 | QuantizationXLM-R | —Unverified | 0 |
| Does mBERT understand Romansh? Evaluating word embeddings using word alignment | Jun 14, 2023 | SentenceWord Alignment | CodeCode Available | 0 |
| XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations | Jun 7, 2023 | Cross-Lingual TransferDecoder | CodeCode Available | 0 |
| Exploring the Relationship between Alignment and Cross-lingual Transfer in Multilingual Transformers | Jun 5, 2023 | Cross-Lingual TransferPOS | CodeCode Available | 0 |
| Distilling Efficient Language-Specific Models for Cross-Lingual Transfer | Jun 2, 2023 | Cross-Lingual TransferTransfer Learning | CodeCode Available | 0 |
| ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment | May 23, 2023 | BenchmarkingCross-Lingual Transfer | CodeCode Available | 1 |