| SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects | Sep 14, 2023 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 1 |
| Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens | Sep 8, 2023 | Citation PredictionTopic Classification | CodeCode Available | 0 |
| Benchmarking Multilabel Topic Classification in the Kyrgyz Language | Aug 30, 2023 | BenchmarkingClassification | CodeCode Available | 0 |
| American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers | Aug 24, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model | Aug 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| vONTSS: vMF based semi-supervised neural topic modeling with optimal transport | Jul 3, 2023 | ClassificationDiversity | —Unverified | 0 |
| Towards Open-Domain Topic Classification | Jun 29, 2023 | ClassificationLanguage Modeling | —Unverified | 0 |
| Multilingual Few-Shot Learning via Language Model Retrieval | Jun 19, 2023 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| Monolingual and Cross-Lingual Knowledge Transfer for Topic Classification | Jun 13, 2023 | ClassificationTopic Classification | —Unverified | 0 |
| Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati | Jun 12, 2023 | Classificationregression | CodeCode Available | 0 |
| Leveraging Large Language Models for Topic Classification in the Domain of Public Affairs | Jun 5, 2023 | Decision MakingTopic Classification | —Unverified | 0 |
| Efficient Document Embeddings via Self-Contrastive Bregman Divergence Learning | May 25, 2023 | Contrastive LearningInformation Retrieval | —Unverified | 0 |
| Regex-augmented Domain Transfer Topic Classification based on a Pre-trained Language Model: An application in Financial Domain | May 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-Shot Text Classification via Self-Supervised Tuning | May 19, 2023 | ClassificationSelf-Supervised Learning | CodeCode Available | 1 |
| A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero Shot | May 16, 2023 | Emotion ClassificationQuestion Answering | CodeCode Available | 0 |
| Quantifying the Dissimilarity of Texts | May 3, 2023 | ClusteringInformation Retrieval | CodeCode Available | 0 |
| MasakhaNEWS: News Topic Classification for African languages | Apr 19, 2023 | ClassificationFew-Shot Learning | CodeCode Available | 1 |
| Deep Learning for Opinion Mining and Topic Classification of Course Reviews | Apr 6, 2023 | Opinion MiningTopic Classification | —Unverified | 0 |
| When Crowd Meets Persona: Creating a Large-Scale Open-Domain Persona Dialogue Corpus | Apr 1, 2023 | Dialogue GenerationQuestion Answering | —Unverified | 0 |
| Topic Segmentation Model Focusing on Local Context | Jan 5, 2023 | Information Retrievalmodel | —Unverified | 0 |
| QBERT: Generalist Model for Processing Questions | Dec 5, 2022 | modelQuestion Answering | —Unverified | 0 |
| TEMPERA: Test-Time Prompting via Reinforcement Learning | Nov 21, 2022 | Few-Shot LearningNatural Language Inference | CodeCode Available | 1 |
| TCBERT: A Technical Report for Chinese Topic Classification BERT | Nov 21, 2022 | ClassificationContrastive Learning | —Unverified | 0 |
| CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification | Nov 11, 2022 | Classificationcounterfactual | —Unverified | 0 |
| Hierarchical Multi-Label Classification of Scientific Documents | Nov 5, 2022 | ClassificationHierarchical Multi-label Classification | CodeCode Available | 1 |
| Conformal Predictor for Improving Zero-shot Text Classification Efficiency | Oct 23, 2022 | ClassificationNatural Language Inference | —Unverified | 0 |
| Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods | Oct 13, 2022 | Abstractive Text SummarizationFeature Importance | CodeCode Available | 0 |
| HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea | Oct 11, 2022 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| Twitter Topic Classification | Sep 20, 2022 | ClassificationTopic Classification | —Unverified | 0 |
| Non-Parametric Temporal Adaptation for Social Media Topic Classification | Sep 13, 2022 | ClassificationRetrieval | —Unverified | 0 |
| CTM - A Model for Large-Scale Multi-View Tweet Topic Classification | Jul 1, 2022 | ClassificationTopic Classification | —Unverified | 0 |
| Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification | Jun 8, 2022 | Cross-Lingual TransferTopic Classification | —Unverified | 0 |
| Near-Term Advances in Quantum Natural Language Processing | Jun 5, 2022 | Topic Classification | —Unverified | 0 |
| Estimating Confidence of Predictions of Individual Classifiers and TheirEnsembles for the Genre Classification Task | Jun 1, 2022 | Genre classificationtext-classification | —Unverified | 0 |
| FrameASt: A Framework for Second-level Agenda Setting in Parliamentary Debates through the Lense of Comparative Agenda Topics | Jun 1, 2022 | Topic Classification | CodeCode Available | 0 |
| Leveraging QA Datasets to Improve Generative Data Augmentation | May 25, 2022 | Common Sense ReasoningData Augmentation | CodeCode Available | 0 |
| Improving Short Text Classification With Augmented Data Using GPT-3 | May 23, 2022 | ClassificationLanguage Modeling | —Unverified | 0 |
| Sentence-level Privacy for Document Embeddings | May 10, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CTM -- A Model for Large-Scale Multi-View Tweet Topic Classification | May 3, 2022 | ClassificationTopic Classification | —Unverified | 0 |
| Clause Topic Classification in German and English Standard Form Contracts | May 1, 2022 | ClassificationForm | —Unverified | 0 |
| Polyglot Prompt: Multilingual Multitask PrompTraining | Apr 29, 2022 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| Multi-label topic classification for COVID-19 literature with Bioformer | Apr 14, 2022 | ArticlesClassification | —Unverified | 0 |
| Label Semantic Aware Pre-training for Few-shot Text Classification | Apr 14, 2022 | ClassificationFew-Shot Text Classification | CodeCode Available | 1 |
| Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-Tuning | Apr 13, 2022 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 1 |
| Prototypical Verbalizer for Prompt-based Few-shot Tuning | Mar 18, 2022 | Contrastive LearningEntity Typing | CodeCode Available | 4 |
| Unsupervised Keyphrase Extraction via Interpretable Neural Networks | Mar 15, 2022 | ArticlesKeyphrase Extraction | CodeCode Available | 0 |
| Unlearnable Text for Neural Classifiers | Jan 16, 2022 | ClassificationGender Classification | —Unverified | 0 |
| Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification | Jan 16, 2022 | ClassificationCross-Lingual Transfer | —Unverified | 0 |
| Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models | Nov 28, 2021 | ClassificationCross-Lingual Transfer | CodeCode Available | 0 |
| Sentence-level Privacy for Document Embeddings | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |