| Medical-GAT: Cancer Document Classification Leveraging Graph-Based Residual Network for Scenarios with Limited Data | Oct 19, 2024 | Document ClassificationGraph Attention | —Unverified | 0 |
| Weakly-supervised diagnosis identification from Italian discharge letters | Oct 19, 2024 | Document Classificationtext-classification | —Unverified | 0 |
| ChuLo: Chunk-Level Key Information Representation for Long Document Processing | Oct 14, 2024 | ChunkingClassification | CodeCode Available | 0 |
| Text Classification using Graph Convolutional Networks: A Comprehensive Survey | Oct 12, 2024 | ClassificationDocument Classification | —Unverified | 0 |
| Orthogonal Nonnegative Matrix Factorization with the Kullback-Leibler divergence | Oct 10, 2024 | Document Classification | CodeCode Available | 0 |
| Manual Verbalizer Enrichment for Few-Shot Text Classification | Oct 8, 2024 | BenchmarkingClassification | —Unverified | 0 |
| Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification | Oct 3, 2024 | Document ClassificationGraph Attention | —Unverified | 0 |
| FLAG: Financial Long Document Classification via AMR-based GNN | Oct 2, 2024 | Abstract Meaning RepresentationDocument Classification | CodeCode Available | 0 |
| Document Type Classification using File Names | Oct 2, 2024 | ClassificationDocument Classification | —Unverified | 0 |
| On Importance of Pruning and Distillation for Efficient Low Resource NLP | Sep 21, 2024 | Document ClassificationGPU | —Unverified | 0 |
| SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization | Sep 10, 2024 | Document Classificationnamed-entity-recognition | CodeCode Available | 0 |
| Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification | Aug 20, 2024 | Document AIDocument Classification | CodeCode Available | 0 |
| AutoML-guided Fusion of Entity and LLM-based Representations for Document Classification | Aug 19, 2024 | AutoMLClassification | CodeCode Available | 0 |
| Diagnosis extraction from unstructured Dutch echocardiogram reports using span- and document-level characteristic classification | Aug 13, 2024 | ClassificationDocument Classification | CodeCode Available | 0 |
| Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian | Jul 30, 2024 | Document ClassificationEntity Typing | —Unverified | 0 |
| An Improved Method for Class-specific Keyword Extraction: A Case Study in the German Business Registry | Jul 19, 2024 | Document ClassificationKeyword Extraction | CodeCode Available | 0 |
| Hierarchical Multi-modal Transformer for Cross-modal Long Document Classification | Jul 14, 2024 | Document ClassificationSentence | —Unverified | 0 |
| Rapid Biomedical Research Classification: The Pandemic PACT Advanced Categorisation Engine | Jul 14, 2024 | Decision MakingDocument Classification | —Unverified | 0 |
| Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification | Jun 3, 2024 | Document Classification | —Unverified | 0 |
| Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification | May 29, 2024 | Document Classification | —Unverified | 0 |
| Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark | May 17, 2024 | Document ClassificationLanguage Modeling | —Unverified | 0 |
| Length-Aware Multi-Kernel Transformer for Long Document Classification | May 11, 2024 | Document ClassificationSentence | CodeCode Available | 0 |
| Improving Long Text Understanding with Knowledge Distilled from Summarization Model | May 8, 2024 | Abstractive Text SummarizationDocument Classification | —Unverified | 0 |
| CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification | May 6, 2024 | Document Classificationdocument-image-classification | —Unverified | 0 |
| Machine Unlearning for Document Classification | Apr 29, 2024 | ClassificationDocument Classification | CodeCode Available | 0 |
| L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi | Apr 28, 2024 | ArticlesDocument Classification | CodeCode Available | 0 |
| GuideWalk: A Novel Graph-Based Word Embedding for Enhanced Text Classification | Apr 25, 2024 | ClassificationDocument Classification | —Unverified | 0 |
| BuDDIE: A Business Document Dataset for Multi-task Information Extraction | Apr 5, 2024 | Document Classificationdocument understanding | —Unverified | 0 |
| Developing Healthcare Language Model Embedding Spaces | Mar 28, 2024 | Contrastive LearningDocument Classification | —Unverified | 0 |
| Clustering Document Parts: Detecting and Characterizing Influence Campaigns from Documents | Feb 27, 2024 | ClusteringDocument Classification | CodeCode Available | 0 |
| NLP for Knowledge Discovery and Information Extraction from Energetics Corpora | Feb 10, 2024 | ArticlesDocument Classification | —Unverified | 0 |
| Efficient Models for the Detection of Hate, Abuse and Profanity | Feb 8, 2024 | Document Classificationnamed-entity-recognition | —Unverified | 0 |
| Generalized Sobolev Transport for Probability Measures on a Graph | Feb 7, 2024 | Document ClassificationTopological Data Analysis | CodeCode Available | 0 |
| A Learning oriented DLP System based on Classification Model | Dec 21, 2023 | ClassificationDocument Classification | —Unverified | 0 |
| Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs | Dec 21, 2023 | Document ClassificationKnowledge Graphs | —Unverified | 0 |
| MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA | Dec 19, 2023 | Document ClassificationHallucination | CodeCode Available | 0 |
| Large language models in healthcare and medical domain: A review | Dec 12, 2023 | Document Classificationnamed-entity-recognition | —Unverified | 0 |
| Summarization-based Data Augmentation for Document Classification | Dec 1, 2023 | ClassificationData Augmentation | CodeCode Available | 0 |
| SUT: a new multi-purpose synthetic dataset for Farsi document image analysis | Nov 27, 2023 | Document Classificationdocument-image-classification | CodeCode Available | 0 |
| Learning Section Weights for Multi-Label Document Classification | Nov 26, 2023 | ArticlesClassification | —Unverified | 0 |
| Causality is all you need | Nov 21, 2023 | AllDocument Classification | —Unverified | 0 |
| ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science | Nov 21, 2023 | Document ClassificationGraph Neural Network | —Unverified | 0 |
| Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets | Nov 15, 2023 | Document Classificationtext-classification | —Unverified | 0 |
| A Multi-Modal Multilingual Benchmark for Document Image Classification | Oct 25, 2023 | ClassificationCross-Lingual Transfer | —Unverified | 0 |
| Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents | Oct 25, 2023 | AllDocument Classification | —Unverified | 0 |
| Optimal Transport for Measures with Noisy Tree Metric | Oct 20, 2023 | Document ClassificationTopological Data Analysis | CodeCode Available | 0 |
| BibRank: Automatic Keyphrase Extraction Platform Using~Metadata | Oct 13, 2023 | ClusteringDocument Classification | CodeCode Available | 0 |
| An Analysis on Large Language Models in Healthcare: A Case Study of BioBERT | Oct 11, 2023 | Document ClassificationInformation Retrieval | —Unverified | 0 |
| KoBigBird-large: Transformation of Transformer for Korean Language Understanding | Sep 19, 2023 | Document ClassificationQuestion Answering | —Unverified | 0 |
| Beyond Document Page Classification: Design, Datasets, and Challenges | Aug 24, 2023 | BenchmarkingClassification | CodeCode Available | 0 |