SOTAVerified

Document Classification

Document Classification is a procedure of assigning one or more labels to a document from a predetermined set of labels.

Source: Long-length Legal Document Classification

Papers

Showing 101150 of 641 papers

TitleStatusHype
Leveraging BERT Language Model for Arabic Long Document Classification0
A New Information Theory of Certainty for Machine Learning0
HeRo: RoBERTa and Longformer Hebrew Language Models0
Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language UnderstandingCode1
Disentangling Structure and Style: Political Bias Detection in News by Inducing Document HierarchyCode0
A semi-automatic method for document classification in the shipping industry0
Finding the Needle in a Haystack: Unsupervised Rationale Extraction from Long Text Classifiers0
Neural Nonnegative Matrix Factorization for Hierarchical Multilayer Topic Modeling0
Elementwise Language Representation0
MatKB: Semantic Search for Polycrystalline Materials Synthesis ProceduresCode0
Bioformer: an efficient transformer language model for biomedical text miningCode1
A Comparative Study of Pretrained Language Models for Long Clinical TextCode1
FewShotTextGCN: K-hop neighborhood regularization for few-shot learning on graphs0
ClassBases at CASE-2022 Multilingual Protest Event Detection Tasks: Multilingual Protest News Detection and Automatically Replicating Manually Created Event DatasetsCode0
Multimodal Side-Tuning for Document ClassificationCode1
Hawk: An Industrial-strength Multi-label Document Classifier0
Tsetlin Machine Embedding: Representing Words Using Logical ExpressionsCode1
Human in the loop: How to effectively create coherent topics by manually labeling only a few documents per class0
Generalised Spherical Text Embedding0
Text Representation Enrichment Utilizing Graph based Approaches: Stock Market Technical Analysis Case Study0
Extended Multilingual Protest News Detection -- Shared Task 1, CASE 2021 and 20220
Processing Long Legal Documents with Pre-trained Transformers: Modding LegalBERT and Longformer0
BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and MiningCode4
Evaluating Out-of-Distribution Performance on Document Image ClassifiersCode0
Lbl2Vec: An Embedding-Based Approach for Unsupervised Document Retrieval on Predefined TopicsCode1
An Exploration of Hierarchical Attention Transformers for Efficient Long Document ClassificationCode0
Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents0
CNN-Trans-Enc: A CNN-Enhanced Transformer-Encoder On Top Of Static BERT representations for Document Classification0
Flexible Job Classification with Zero-Shot Learning0
D2GCLF: Document-to-Graph Classifier for Legal Document Classification0
BL.Research at SemEval-2022 Task 8: Using various Semantic Information to evaluate document-level Semantic Textual SimilarityCode0
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding0
Supervised Dictionary Learning with Auxiliary CovariatesCode0
ChordMixer: A Scalable Neural Attention Model for Sequences with Different LengthsCode1
Knowledge-based Document Classification with Shannon Entropy0
LDRNet: Enabling Real-time Document Localization on Mobile DevicesCode1
UMUTextStats: A linguistic feature extraction tool for Spanish0
Enriching Epidemiological Thematic Features For Disease Surveillance Corpora Classification0
ConvTextTM: An Explainable Convolutional Tsetlin Machine Framework for Text Classification0
Approximate Conditional Coverage & Calibration via Neural Model Approximations0
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessCode6
BabyBear: Cheap inference triage for expensive language modelsCode0
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification0
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemCode1
Towards Comprehensive Patent Approval Predictions:Beyond Traditional Document Classification0
Revisiting Transformer-based Models for Long Document ClassificationCode1
Analysis of Sparse Subspace Clustering: Experiments and Random Projection0
LinkBERT: Pretraining Language Models with Document LinksCode2
An Evaluation Dataset for Legal Word Embedding: A Case Study On Chinese CodexCode0
Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking0
Show:102550
← PrevPage 3 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy97.17Unverified
2REL-RWMD k-NNAccuracy95.61Unverified
3Orthogonalized Soft VSMAccuracy92.65Unverified
4MAGNETF189.9Unverified
5VLAWEF189.3Unverified
6KD-LSTMregF188.9Unverified
7LSTM-reg (single model)F187Unverified
8SCDV-MSF182.71Unverified
#ModelMetricClaimedVerifiedStatus
1ACNetAccuracy83.5Unverified
2LGCNAccuracy83.3Unverified
3GATAccuracy83Unverified
4MoNetAccuracy81.7Unverified
5DeepWalkAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)F188.1Unverified
2NCBI_BERT(large) (P)F187.3Unverified
3SciFive-largeF186.08Unverified
4BioGPTMicro F185.12Unverified
5PubMedBERT uncasedMicro F182.32Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy99.59Unverified
2Orthogonalized Soft VSMAccuracy97.73Unverified
3ApproxRepSetAccuracy95.73Unverified
4REL-RWMD k-NNAccuracy95.18Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy94.31Unverified
2Orthogonalized Soft VSMAccuracy93.42Unverified
3REL-RWMD k-NNAccuracy93.03Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy72.6Unverified
2REL-RWMD k-NNAccuracy71.05Unverified
3Orthogonalized Soft VSMAccuracy69.21Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregF172.9Unverified
2MAGNETF169.6Unverified
#ModelMetricClaimedVerifiedStatus
1REL-RWMD k-NNAccuracy96.85Unverified
2ApproxRepSetAccuracy96.24Unverified
#ModelMetricClaimedVerifiedStatus
1Document Classification Using Importance of SentencesAccuracy54.8Unverified
2LSTM-reg (single model)Accuracy52.8Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy59.06Unverified
2REL-RWMD k-NNAccuracy56.8Unverified
#ModelMetricClaimedVerifiedStatus
1SPECTERF1 (micro)82Unverified
2SciNCLF1 (micro)81.4Unverified
#ModelMetricClaimedVerifiedStatus
1SciNCLF1 (micro)88.7Unverified
2SPECTERF1 (micro)86.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvTextTMAccuracy91.28Unverified
2HDLTexAccuracy90.93Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy95.38Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy64.4Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy89.81Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy75Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy86.5Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy86.07Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy76.58Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregAccuracy69.4Unverified