| SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects | Sep 14, 2023 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 1 | 5 |
| Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering | Jul 6, 2021 | Active LearningObject Recognition | CodeCode Available | 1 | 5 |
| Newswire: A Large-Scale Structured Database of a Century of Historical News | Jun 13, 2024 | ArticlesEntity Disambiguation | CodeCode Available | 1 | 5 |
| Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling | May 1, 2024 | HallucinationTopic Classification | CodeCode Available | 0 | 5 |
| BagBERT: BERT-based bagging-stacking for multi-topic classification | Nov 10, 2021 | ClassificationEnsemble Learning | CodeCode Available | 0 | 5 |
| A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings | May 17, 2025 | Abusive LanguageTopic Classification | CodeCode Available | 0 | 5 |
| A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero Shot | May 16, 2023 | Emotion ClassificationQuestion Answering | CodeCode Available | 0 | 5 |
| Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligence | Sep 24, 2023 | BenchmarkingChange Detection | CodeCode Available | 0 | 5 |
| Automatic Classification of News Subjects in Broadcast News: Application to a Gender Bias Representation Analysis | Jul 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 | 5 |
| LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Nov 29, 2024 | ArticlesClassification | CodeCode Available | 0 | 5 |
| A thorough benchmark of automatic text classification: From traditional approaches to large language models | Apr 2, 2025 | Sentiment Analysistext-classification | CodeCode Available | 0 | 5 |
| Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization | May 28, 2019 | General ClassificationMachine Translation | CodeCode Available | 0 | 5 |
| Active learning in annotating micro-blogs dealing with e-reputation | Jun 16, 2017 | Active LearningInformation Retrieval | CodeCode Available | 0 | 5 |
| Inference and Verbalization Functions During In-Context Learning | Oct 12, 2024 | In-Context LearningNatural Language Inference | CodeCode Available | 0 | 5 |
| Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati | Jun 12, 2023 | Classificationregression | CodeCode Available | 0 | 5 |
| Controlling the Interaction Between Generation and Inference in Semi-Supervised Variational Autoencoders Using Importance Weighting | Oct 13, 2020 | Sentiment AnalysisTopic Classification | CodeCode Available | 0 | 5 |
| Give your Text Representation Models some Love: the Case for Basque | Mar 31, 2020 | General ClassificationNER | CodeCode Available | 0 | 5 |
| Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods | Oct 13, 2022 | Abstractive Text SummarizationFeature Importance | CodeCode Available | 0 | 5 |
| Optimal and efficient text counterfactuals using Graph Neural Networks | Aug 4, 2024 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Leveraging QA Datasets to Improve Generative Data Augmentation | May 25, 2022 | Common Sense ReasoningData Augmentation | CodeCode Available | 0 | 5 |
| From Random to Supervised: A Novel Dropout Mechanism Integrated with Global Information | Aug 24, 2018 | ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| ConCET: Entity-Aware Topic Classification for Open-Domain Conversational Agents | May 28, 2020 | ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| An Overview of the Active Gene Annotation Corpus and the BioNLP OST 2019 AGAC Track Tasks | Nov 1, 2019 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 | 5 |
| Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic Modeling | Apr 11, 2021 | ClassificationCross-Lingual Transfer | CodeCode Available | 0 | 5 |