| Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model | Jun 14, 2022 | Decision MakingNews Classification | CodeCode Available | 2 |
| Tri-Learn Graph Fusion Network for Attributed Graph Clustering | Jul 18, 2025 | ClusteringDeep Clustering | CodeCode Available | 1 |
| MasakhaNEWS: News Topic Classification for African languages | Apr 19, 2023 | ClassificationFew-Shot Learning | CodeCode Available | 1 |
| Multiverse: Multilingual Evidence for Fake News Detection | Nov 25, 2022 | Fake News DetectionNews Classification | CodeCode Available | 1 |
| Wild-Time: A Benchmark of in-the-Wild Distribution Shift over Time | Nov 25, 2022 | Continual LearningDomain Generalization | CodeCode Available | 1 |
| N24News: A New Dataset for Multimodal News Classification | Aug 30, 2021 | Classification | CodeCode Available | 1 |
| Cross-lingual Evidence Improves Monolingual Fake News Detection | Aug 1, 2021 | Fake News DetectionNews Classification | CodeCode Available | 1 |
| Evaluating Various Tokenizers for Arabic Text Classification | Jun 14, 2021 | ClassificationNews Classification | CodeCode Available | 1 |
| AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset | May 7, 2021 | ArticlesDialect Identification | CodeCode Available | 1 |
| IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages | Nov 8, 2020 | Genre classificationMultiple-choice | CodeCode Available | 1 |
| KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi | Oct 23, 2020 | ArticlesBenchmarking | CodeCode Available | 1 |
| Climate-Eval: A Comprehensive Benchmark for NLP Tasks Related to Climate Change | May 24, 2025 | News ClassificationQuestion Answering | —Unverified | 0 |
| Synthetic News Generation for Fake News Classification | Mar 31, 2025 | ArticlesClassification | —Unverified | 0 |
| Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data | Feb 24, 2025 | Active LearningClassification | —Unverified | 0 |
| A Hybrid Transformer Model for Fake News Detection: Leveraging Bayesian Optimization and Bidirectional Recurrent Unit | Feb 13, 2025 | Bayesian OptimizationFake News Detection | —Unverified | 0 |
| A Self-Learning Multimodal Approach for Fake News Detection | Dec 8, 2024 | Contrastive LearningFake News Detection | —Unverified | 0 |
| LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification | Nov 29, 2024 | ArticlesClassification | CodeCode Available | 0 |
| BERT or FastText? A Comparative Analysis of Contextual as well as Non-Contextual Embeddings | Nov 26, 2024 | Hate Speech DetectionNews Classification | CodeCode Available | 0 |
| On Limitations of LLM as Annotator for Low Resource Languages | Nov 26, 2024 | Hate Speech DetectionNews Classification | —Unverified | 0 |
| Comprehensive dataset of user-submitted articles with ideological and extreme bias from Reddit | Aug 12, 2024 | ArticlesHoldout Set | CodeCode Available | 0 |
| RICo: Reddit ideological communities | Jun 5, 2024 | ArticlesNews Classification | CodeCode Available | 0 |
| CANAL -- Cyber Activity News Alerting Language Model: Empirical Approach vs. Expensive LLM | May 10, 2024 | ArticlesFew-Shot Learning | —Unverified | 0 |
| EthioMT: Parallel Corpus for Low-resource Ethiopian Languages | Mar 28, 2024 | Machine TranslationNews Classification | —Unverified | 0 |
| Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models | Mar 17, 2024 | Computational EfficiencyHate Speech Detection | CodeCode Available | 0 |
| Improving Black-box Robustness with In-Context Rewriting | Feb 13, 2024 | News Classificationtext-classification | CodeCode Available | 0 |