| Retrofitting Large Language Models with Dynamic Tokenization | Nov 27, 2024 | DecoderFairness | —Unverified | 0 |
| Transformer-Based Contextualized Language Models Joint with Neural Networks for Natural Language Inference in Vietnamese | Nov 20, 2024 | Natural Language InferenceXLM-R | —Unverified | 0 |
| From N-grams to Pre-trained Multilingual Models For Language Identification | Oct 11, 2024 | Language IdentificationXLM-R | CodeCode Available | 0 |
| GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge | Sep 26, 2024 | Natural Language InferenceSentiment Analysis | CodeCode Available | 1 |
| LangSAMP: Language-Script Aware Multilingual Pretraining | Sep 26, 2024 | Continual PretrainingLanguage Modeling | CodeCode Available | 0 |
| mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval | Jul 29, 2024 | Contrastive LearningReranking | —Unverified | 0 |
| The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models | Jun 27, 2024 | Cross-Lingual TransferSentiment Analysis | —Unverified | 0 |
| Medical Spoken Named Entity Recognition | Jun 19, 2024 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Multilingual Large Language Models and Curse of Multilinguality | Jun 15, 2024 | DecoderXLM-R | —Unverified | 0 |
| Exploring Alignment in Shared Cross-lingual Spaces | May 23, 2024 | Machine Translationnamed-entity-recognition | CodeCode Available | 0 |