| Prototypical Verbalizer for Prompt-based Few-shot Tuning | Mar 18, 2022 | Contrastive LearningEntity Typing | CodeCode Available | 4 |
| GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge | Sep 26, 2024 | Natural Language InferenceSentiment Analysis | CodeCode Available | 1 |
| Newswire: A Large-Scale Structured Database of a Century of Historical News | Jun 13, 2024 | ArticlesEntity Disambiguation | CodeCode Available | 1 |
| SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation | May 16, 2024 | Bias DetectionDiversity | CodeCode Available | 1 |
| LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons | Feb 21, 2024 | Sentiment AnalysisTopic Classification | CodeCode Available | 1 |
| L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages | Jan 4, 2024 | ArticlesClassification | CodeCode Available | 1 |
| In-Context Learning with Iterative Demonstration Selection | Oct 15, 2023 | Few-Shot LearningIn-Context Learning | CodeCode Available | 1 |
| SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects | Sep 14, 2023 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 1 |
| Zero-Shot Text Classification via Self-Supervised Tuning | May 19, 2023 | ClassificationSelf-Supervised Learning | CodeCode Available | 1 |
| MasakhaNEWS: News Topic Classification for African languages | Apr 19, 2023 | ClassificationFew-Shot Learning | CodeCode Available | 1 |