| Specialized Document Embeddings for Aspect-based Similarity of Research Papers | Mar 28, 2022 | Document ClassificationRecommendation Systems | CodeCode Available | 1 |
| Efficient Classification of Long Documents Using Transformers | Mar 21, 2022 | ClassificationDocument Classification | —Unverified | 0 |
| DocXClassifier: High Performance Explainable Deep Network for Document Image Classification | Mar 17, 2022 | ClassificationData Augmentation | CodeCode Available | 1 |
| A Survey of Historical Document Image Datasets | Mar 16, 2022 | Document ClassificationSurvey | —Unverified | 0 |
| Improved Multi-label Classification under Temporal Concept Drift: Rethinking Group-Robust Algorithms in a Label-Wise Setting | Mar 15, 2022 | Document ClassificationMulti-Label Classification | CodeCode Available | 0 |
| Semi-supervised Nonnegative Matrix Factorization for Document Classification | Feb 28, 2022 | ClassificationDocument Classification | —Unverified | 0 |
| Sobolev Transport: A Scalable Metric for Probability Measures with Graph Metrics | Feb 22, 2022 | Document ClassificationTopological Data Analysis | CodeCode Available | 0 |
| One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian Optimization | Feb 15, 2022 | AllBayesian Optimization | CodeCode Available | 2 |
| Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings | Feb 14, 2022 | Citation PredictionContrastive Learning | CodeCode Available | 1 |
| Moving Other Way: Exploring Word Mover Distance Extensions | Feb 7, 2022 | ClassificationDocument Classification | —Unverified | 0 |
| Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences | Jan 27, 2022 | Clinical KnowledgeDocument Classification | CodeCode Available | 1 |
| Importance of Textlines in Historical Document Classification | Jan 24, 2022 | ClassificationDocument Classification | —Unverified | 0 |
| Recurrent Neural Networks with Mixed Hierarchical Structures and EM Algorithm for Natural Language Processing | Jan 21, 2022 | Document Classification | —Unverified | 0 |
| Automation of Citation Screening for Systematic Literature Reviews using Neural Networks: A Replicability Study | Jan 19, 2022 | Document ClassificationSystematic Literature Review | CodeCode Available | 0 |
| Hierarchical Neural Network Approaches for Long Document Classification | Jan 18, 2022 | ClassificationDocument Classification | —Unverified | 0 |
| ERNIE-Layout: Layout-Knowledge Enhanced Multi-modal Pre-training for Document Understanding | Jan 16, 2022 | cross-modal alignmentDocument Classification | CodeCode Available | 0 |
| Document Classification with Word Sense Knowledge | Jan 16, 2022 | ClassificationDocument Classification | —Unverified | 0 |
| Intelligent Document Processing -- Methods and Tools in the real world | Dec 28, 2021 | Document ClassificationOptical Character Recognition (OCR) | —Unverified | 0 |
| Sublinear Time Approximation of Text Similarity Matrices | Dec 17, 2021 | Document ClassificationSentence | CodeCode Available | 0 |
| An Empirical Study on Transfer Learning for Privilege Review | Dec 16, 2021 | BIG-bench Machine LearningDocument Classification | —Unverified | 0 |
| Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification | Dec 13, 2021 | ClassificationDocument Classification | CodeCode Available | 1 |
| Revisiting Transformer-based Models for Long Document Classification | Nov 16, 2021 | ClassificationDocument Classification | —Unverified | 0 |
| Temporal Language Modeling for Short Text Document Classification with Transformers | Nov 16, 2021 | ClassificationDocument Classification | —Unverified | 0 |
| Improved Multi-label Classification under Temporal Concept Drift: Rethinking Group-Robust Algorithms in a Label-Wise Setting | Nov 16, 2021 | Document ClassificationMulti-Label Classification | —Unverified | 0 |
| MNet-Sim: A Multi-layered Semantic Similarity Network to Evaluate Sentence Similarity | Nov 9, 2021 | ChatbotDocument Classification | —Unverified | 0 |
| Feature Selective Likelihood Ratio Estimator for Low- and Zero-frequency N-grams | Nov 5, 2021 | Document Classificationfeature selection | —Unverified | 0 |
| Augmentations in Graph Contrastive Learning: Current Methodological Flaws & Towards Better Practices | Nov 5, 2021 | ClassificationContrastive Learning | —Unverified | 0 |
| Softmax Tree: An Accurate, Fast Classifier When the Number of Classes Is Large | Nov 1, 2021 | Document Classification | —Unverified | 0 |
| MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer | Nov 1, 2021 | Cross-Lingual TransferDocument Classification | CodeCode Available | 1 |
| Effective Convolutional Attention Network for Multi-label Clinical Document Classification | Nov 1, 2021 | ClassificationDocument Classification | —Unverified | 0 |
| Beyond Text: Incorporating Metadata and Label Structure for Multi-Label Document Classification using Heterogeneous Graphs | Nov 1, 2021 | ClassificationDocument Classification | —Unverified | 0 |
| Legal Terminology Extraction with the Termolator | Nov 1, 2021 | ArticlesDocument Classification | —Unverified | 0 |
| Effectively Leveraging BERT for Legal Document Classification | Nov 1, 2021 | Binary ClassificationClassification | —Unverified | 0 |
| Domain-adaptation of spherical embeddings | Nov 1, 2021 | Document ClassificationDomain Adaptation | —Unverified | 0 |
| Comparative Study of Long Document Classification | Nov 1, 2021 | BIG-bench Machine LearningClassification | —Unverified | 0 |
| Domain Agnostic Few-Shot Learning For Document Intelligence | Oct 29, 2021 | ClassificationCross-Domain Few-Shot | —Unverified | 0 |
| Contrastive Document Representation Learning with Graph Attention Networks | Oct 20, 2021 | Contrastive LearningDocument Classification | —Unverified | 0 |
| Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization | Oct 10, 2021 | Abstractive Text SummarizationDocument Classification | —Unverified | 0 |
| Weakly Supervised Concept Map Generation through Task-Guided Graph Translation | Oct 8, 2021 | Document ClassificationTranslation | CodeCode Available | 0 |
| JOINTLY LEARNING TOPIC SPECIFIC WORD AND DOCUMENT EMBEDDING | Sep 29, 2021 | Document ClassificationDocument Embedding | —Unverified | 0 |
| Towards Comprehensive Patent Approval Predictions:Beyond Traditional Document Classification | Sep 17, 2021 | ClassificationDocument Classification | —Unverified | 0 |
| Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution | Sep 10, 2021 | Document ClassificationMulti-Label Text Classification | CodeCode Available | 1 |
| MultiEURLEX -- A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer | Sep 2, 2021 | Cross-Lingual TransferDocument Classification | CodeCode Available | 1 |
| Towards Explaining STEM Document Classification using Mathematical Entity Linking | Sep 2, 2021 | ClassificationDocument Classification | CodeCode Available | 0 |
| Application of Mix-Up Method in Document Classification Task Using BERT | Sep 1, 2021 | ClassificationData Augmentation | —Unverified | 0 |
| Domain-Specific Japanese ELECTRA Model Using a Small Corpus | Sep 1, 2021 | ArticlesComputational Efficiency | —Unverified | 0 |
| Improving Neural Language Processing with Named Entities | Sep 1, 2021 | Document ClassificationHeadline Generation | —Unverified | 0 |
| Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP | Aug 5, 2021 | Document ClassificationOut-of-Distribution Generalization | —Unverified | 0 |
| PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors | Aug 2, 2021 | Document ClassificationSpecificity | CodeCode Available | 1 |
| Multilingual Protest News Detection - Shared Task 1, CASE 2021 | Aug 1, 2021 | BenchmarkingDecision Making | —Unverified | 0 |