SOTAVerified

Document Classification

Document Classification is a procedure of assigning one or more labels to a document from a predetermined set of labels.

Source: Long-length Legal Document Classification

Papers

Showing 151200 of 641 papers

TitleStatusHype
Specialized Document Embeddings for Aspect-based Similarity of Research PapersCode1
Efficient Classification of Long Documents Using Transformers0
DocXClassifier: High Performance Explainable Deep Network for Document Image ClassificationCode1
A Survey of Historical Document Image Datasets0
Improved Multi-label Classification under Temporal Concept Drift: Rethinking Group-Robust Algorithms in a Label-Wise SettingCode0
Semi-supervised Nonnegative Matrix Factorization for Document Classification0
Sobolev Transport: A Scalable Metric for Probability Measures with Graph MetricsCode0
One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian OptimizationCode2
Neighborhood Contrastive Learning for Scientific Document Representations with Citation EmbeddingsCode1
Moving Other Way: Exploring Word Mover Distance Extensions0
Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequencesCode1
Importance of Textlines in Historical Document Classification0
Recurrent Neural Networks with Mixed Hierarchical Structures and EM Algorithm for Natural Language Processing0
Automation of Citation Screening for Systematic Literature Reviews using Neural Networks: A Replicability StudyCode0
Hierarchical Neural Network Approaches for Long Document Classification0
ERNIE-Layout: Layout-Knowledge Enhanced Multi-modal Pre-training for Document UnderstandingCode0
Document Classification with Word Sense Knowledge0
Intelligent Document Processing -- Methods and Tools in the real world0
Sublinear Time Approximation of Text Similarity MatricesCode0
An Empirical Study on Transfer Learning for Privilege Review0
Sparse Structure Learning via Graph Neural Networks for Inductive Document ClassificationCode1
Revisiting Transformer-based Models for Long Document Classification0
Temporal Language Modeling for Short Text Document Classification with Transformers0
Improved Multi-label Classification under Temporal Concept Drift: Rethinking Group-Robust Algorithms in a Label-Wise Setting0
MNet-Sim: A Multi-layered Semantic Similarity Network to Evaluate Sentence Similarity0
Feature Selective Likelihood Ratio Estimator for Low- and Zero-frequency N-grams0
Augmentations in Graph Contrastive Learning: Current Methodological Flaws & Towards Better Practices0
Softmax Tree: An Accurate, Fast Classifier When the Number of Classes Is Large0
MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transferCode1
Effective Convolutional Attention Network for Multi-label Clinical Document Classification0
Beyond Text: Incorporating Metadata and Label Structure for Multi-Label Document Classification using Heterogeneous Graphs0
Legal Terminology Extraction with the Termolator0
Effectively Leveraging BERT for Legal Document Classification0
Domain-adaptation of spherical embeddings0
Comparative Study of Long Document Classification0
Domain Agnostic Few-Shot Learning For Document Intelligence0
Contrastive Document Representation Learning with Graph Attention Networks0
Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization0
Weakly Supervised Concept Map Generation through Task-Guided Graph TranslationCode0
JOINTLY LEARNING TOPIC SPECIFIC WORD AND DOCUMENT EMBEDDING0
Towards Comprehensive Patent Approval Predictions:Beyond Traditional Document Classification0
Balancing Methods for Multi-label Text Classification with Long-Tailed Class DistributionCode1
MultiEURLEX -- A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transferCode1
Towards Explaining STEM Document Classification using Mathematical Entity LinkingCode0
Application of Mix-Up Method in Document Classification Task Using BERT0
Domain-Specific Japanese ELECTRA Model Using a Small Corpus0
Improving Neural Language Processing with Named Entities0
Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP0
PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc DescriptorsCode1
Multilingual Protest News Detection - Shared Task 1, CASE 20210
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy97.17Unverified
2REL-RWMD k-NNAccuracy95.61Unverified
3Orthogonalized Soft VSMAccuracy92.65Unverified
4MAGNETF189.9Unverified
5VLAWEF189.3Unverified
6KD-LSTMregF188.9Unverified
7LSTM-reg (single model)F187Unverified
8SCDV-MSF182.71Unverified
#ModelMetricClaimedVerifiedStatus
1ACNetAccuracy83.5Unverified
2LGCNAccuracy83.3Unverified
3GATAccuracy83Unverified
4MoNetAccuracy81.7Unverified
5DeepWalkAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)F188.1Unverified
2NCBI_BERT(large) (P)F187.3Unverified
3SciFive-largeF186.08Unverified
4BioGPTMicro F185.12Unverified
5PubMedBERT uncasedMicro F182.32Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy99.59Unverified
2Orthogonalized Soft VSMAccuracy97.73Unverified
3ApproxRepSetAccuracy95.73Unverified
4REL-RWMD k-NNAccuracy95.18Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy94.31Unverified
2Orthogonalized Soft VSMAccuracy93.42Unverified
3REL-RWMD k-NNAccuracy93.03Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy72.6Unverified
2REL-RWMD k-NNAccuracy71.05Unverified
3Orthogonalized Soft VSMAccuracy69.21Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregF172.9Unverified
2MAGNETF169.6Unverified
#ModelMetricClaimedVerifiedStatus
1REL-RWMD k-NNAccuracy96.85Unverified
2ApproxRepSetAccuracy96.24Unverified
#ModelMetricClaimedVerifiedStatus
1Document Classification Using Importance of SentencesAccuracy54.8Unverified
2LSTM-reg (single model)Accuracy52.8Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy59.06Unverified
2REL-RWMD k-NNAccuracy56.8Unverified
#ModelMetricClaimedVerifiedStatus
1SPECTERF1 (micro)82Unverified
2SciNCLF1 (micro)81.4Unverified
#ModelMetricClaimedVerifiedStatus
1SciNCLF1 (micro)88.7Unverified
2SPECTERF1 (micro)86.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvTextTMAccuracy91.28Unverified
2HDLTexAccuracy90.93Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy95.38Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy64.4Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy89.81Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy75Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy86.5Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy86.07Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy76.58Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregAccuracy69.4Unverified