| L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi | Apr 28, 2024 | ArticlesDocument Classification | CodeCode Available | 0 |
| GuideWalk: A Novel Graph-Based Word Embedding for Enhanced Text Classification | Apr 25, 2024 | ClassificationDocument Classification | —Unverified | 0 |
| BuDDIE: A Business Document Dataset for Multi-task Information Extraction | Apr 5, 2024 | Document Classificationdocument understanding | —Unverified | 0 |
| Developing Healthcare Language Model Embedding Spaces | Mar 28, 2024 | Contrastive LearningDocument Classification | —Unverified | 0 |
| Clustering Document Parts: Detecting and Characterizing Influence Campaigns from Documents | Feb 27, 2024 | ClusteringDocument Classification | CodeCode Available | 0 |
| NLP for Knowledge Discovery and Information Extraction from Energetics Corpora | Feb 10, 2024 | ArticlesDocument Classification | —Unverified | 0 |
| Efficient Models for the Detection of Hate, Abuse and Profanity | Feb 8, 2024 | Document Classificationnamed-entity-recognition | —Unverified | 0 |
| Generalized Sobolev Transport for Probability Measures on a Graph | Feb 7, 2024 | Document ClassificationTopological Data Analysis | CodeCode Available | 0 |
| A Learning oriented DLP System based on Classification Model | Dec 21, 2023 | ClassificationDocument Classification | —Unverified | 0 |
| Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs | Dec 21, 2023 | Document ClassificationKnowledge Graphs | —Unverified | 0 |
| MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA | Dec 19, 2023 | Document ClassificationHallucination | CodeCode Available | 0 |
| Large language models in healthcare and medical domain: A review | Dec 12, 2023 | Document Classificationnamed-entity-recognition | —Unverified | 0 |
| Summarization-based Data Augmentation for Document Classification | Dec 1, 2023 | ClassificationData Augmentation | CodeCode Available | 0 |
| SUT: a new multi-purpose synthetic dataset for Farsi document image analysis | Nov 27, 2023 | Document Classificationdocument-image-classification | CodeCode Available | 0 |
| Learning Section Weights for Multi-Label Document Classification | Nov 26, 2023 | ArticlesClassification | —Unverified | 0 |
| Causality is all you need | Nov 21, 2023 | AllDocument Classification | —Unverified | 0 |
| ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science | Nov 21, 2023 | Document ClassificationGraph Neural Network | —Unverified | 0 |
| Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets | Nov 15, 2023 | Document Classificationtext-classification | —Unverified | 0 |
| A Multi-Modal Multilingual Benchmark for Document Image Classification | Oct 25, 2023 | ClassificationCross-Lingual Transfer | —Unverified | 0 |
| Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents | Oct 25, 2023 | AllDocument Classification | —Unverified | 0 |
| Optimal Transport for Measures with Noisy Tree Metric | Oct 20, 2023 | Document ClassificationTopological Data Analysis | CodeCode Available | 0 |
| BibRank: Automatic Keyphrase Extraction Platform Using~Metadata | Oct 13, 2023 | ClusteringDocument Classification | CodeCode Available | 0 |
| An Analysis on Large Language Models in Healthcare: A Case Study of BioBERT | Oct 11, 2023 | Document ClassificationInformation Retrieval | —Unverified | 0 |
| KoBigBird-large: Transformation of Transformer for Korean Language Understanding | Sep 19, 2023 | Document ClassificationQuestion Answering | —Unverified | 0 |
| Beyond Document Page Classification: Design, Datasets, and Challenges | Aug 24, 2023 | BenchmarkingClassification | CodeCode Available | 0 |