| SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects | Sep 14, 2023 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 1 |
| Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens | Sep 8, 2023 | Citation PredictionTopic Classification | CodeCode Available | 0 |
| Benchmarking Multilabel Topic Classification in the Kyrgyz Language | Aug 30, 2023 | BenchmarkingClassification | CodeCode Available | 0 |
| American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers | Aug 24, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model | Aug 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| vONTSS: vMF based semi-supervised neural topic modeling with optimal transport | Jul 3, 2023 | ClassificationDiversity | —Unverified | 0 |
| Towards Open-Domain Topic Classification | Jun 29, 2023 | ClassificationLanguage Modeling | —Unverified | 0 |
| Multilingual Few-Shot Learning via Language Model Retrieval | Jun 19, 2023 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| Monolingual and Cross-Lingual Knowledge Transfer for Topic Classification | Jun 13, 2023 | ClassificationTopic Classification | —Unverified | 0 |
| Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati | Jun 12, 2023 | Classificationregression | CodeCode Available | 0 |