| FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extrapolating Multilingual Understanding Models as Multilingual Generators | May 22, 2023 | DenoisingLanguage Modeling | —Unverified | 0 |
| DUMB: A Benchmark for Smart Evaluation of Dutch Models | May 22, 2023 | XLM-R | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages | May 20, 2023 | Language ModellingXLM-R | CodeCode Available | 1 |
| USTC-NELSLIP at SemEval-2023 Task 2: Statistical Construction and Dual Adaptation of Gazetteer for Multilingual Complex NER | May 4, 2023 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuning | May 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese | Apr 18, 2023 | Cross-Lingual Transfernamed-entity-recognition | —Unverified | 0 |
| GreekBART: The First Pretrained Greek Sequence-to-Sequence Model | Apr 3, 2023 | Natural Language InferenceText Generation | CodeCode Available | 0 |
| Tollywood Emotions: Annotation of Valence-Arousal in Telugu Song Lyrics | Mar 16, 2023 | Emotion RecognitionMusic Emotion Recognition | —Unverified | 0 |
| Evaluating the Effectiveness of Pre-trained Language Models in Predicting the Helpfulness of Online Product Reviews | Feb 19, 2023 | Feature EngineeringXLM-R | CodeCode Available | 0 |
| Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval | Feb 3, 2023 | RelationRepresentation Learning | CodeCode Available | 0 |
| LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain | Jan 30, 2023 | XLM-R | CodeCode Available | 1 |
| Multilingual Sentence Transformer as A Multilingual Word Aligner | Jan 28, 2023 | SentenceWord Alignment | CodeCode Available | 1 |
| XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ViHOS: Hate Speech Spans Detection for Vietnamese | Jan 24, 2023 | Hate Span IdentificationSequence-to-sequence Language Modeling | CodeCode Available | 1 |
| Integrating Semantic Information into Sketchy Reading Module of Retro-Reader for Vietnamese Machine Reading Comprehension | Jan 1, 2023 | Machine Reading ComprehensionReading Comprehension | —Unverified | 0 |
| DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue | Dec 15, 2022 | Semantic ParsingXLM-R | CodeCode Available | 0 |
| VTCC-NLP at NL4Opt competition subtask 1: An Ensemble Pre-trained language models for Named Entity Recognition | Dec 14, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages | Dec 11, 2022 | Natural Language UnderstandingXLM-R | CodeCode Available | 1 |
| Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin | Dec 10, 2022 | Language ModellingPunctuation Restoration | CodeCode Available | 0 |
| Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer | Dec 4, 2022 | Cross-Lingual TransferXLM-R | —Unverified | 0 |
| Compressing Cross-Lingual Multi-Task Models at Qualtrics | Nov 29, 2022 | ManagementModel Compression | —Unverified | 0 |
| X^2-VLM: All-In-One Pre-trained Model For Vision-Language Tasks | Nov 22, 2022 | AllCross-Modal Retrieval | CodeCode Available | 2 |
| L3Cube-HindBERT and DevBERT: Pre-Trained BERT Transformer models for Devanagari based Hindi and Marathi Languages | Nov 21, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |