SOTAVerified

Document Classification

Document Classification is a procedure of assigning one or more labels to a document from a predetermined set of labels.

Source: Long-length Legal Document Classification

Papers

Showing 51100 of 641 papers

TitleStatusHype
NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long DocumentsCode1
Prompted Contextual Vectors for Spear-Phishing DetectionCode1
NLP for Knowledge Discovery and Information Extraction from Energetics Corpora0
Efficient Models for the Detection of Hate, Abuse and Profanity0
Generalized Sobolev Transport for Probability Measures on a GraphCode0
ANLS* -- A Universal Document Processing Metric for Generative Large Language ModelsCode1
L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic LanguagesCode1
GeoGalactica: A Scientific Large Language Model in GeoscienceCode1
Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs0
A Learning oriented DLP System based on Classification Model0
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRACode0
Large language models in healthcare and medical domain: A review0
Summarization-based Data Augmentation for Document ClassificationCode0
SUT: a new multi-purpose synthetic dataset for Farsi document image analysisCode0
Learning Section Weights for Multi-Label Document Classification0
Causality is all you need0
ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science0
ContraDoc: Understanding Self-Contradictions in Documents with Large Language ModelsCode1
Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets0
A Multi-Modal Multilingual Benchmark for Document Image Classification0
Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents0
Optimal Transport for Measures with Noisy Tree MetricCode0
BibRank: Automatic Keyphrase Extraction Platform Using~MetadataCode0
An Analysis on Large Language Models in Healthcare: A Case Study of BioBERT0
KoBigBird-large: Transformation of Transformer for Korean Language Understanding0
Beyond Document Page Classification: Design, Datasets, and ChallengesCode0
Feature Extraction Using Deep Generative Models for Bangla Text Classification on a New Comprehensive Dataset0
Taken by Surprise: Contrast effect for Similarity ScoresCode1
Accelerated materials language processing enabled by GPT0
Large Language Model Prompt Chaining for Long Legal Document Classification0
LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning0
Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs0
UMLS-KGI-BERT: Data-Centric Knowledge Integration in Transformers for Biomedical Entity Recognition0
Can Model Fusing Help Transformers in Long Document Classification? An Empirical StudyCode0
Attention over pre-trained Sentence Embeddings for Long Document Classification0
MDACE: MIMIC Documents Annotated with Code EvidenceCode0
Unsupervised Sentiment Analysis of Plastic Surgery Social Media Posts0
On Evaluation of Document Classification using RVL-CDIP0
Weakly-Supervised Scientific Document Classification via Retrieval-Augmented Multi-Stage TrainingCode1
Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers0
Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned Documents0
End-to-End Document Classification and Key Information Extraction using Assignment Optimization0
GVdoc: Graph-based Visual Document ClassificationCode0
Neural Natural Language Processing for Long Texts: A Survey on Classification and Summarization0
DUBLIN -- Document Understanding By Language-Image Network0
DLUE: Benchmarking Document Language Understanding0
CWTM: Leveraging Contextualized Word Embeddings from BERT for Neural Topic ModelingCode0
A General-Purpose Multilingual Document EncoderCode0
Benchmarking large language models for biomedical natural language processing applications and recommendationsCode1
HiPool: Modeling Long Documents Using Graph Neural NetworksCode1
Show:102550
← PrevPage 2 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy97.17Unverified
2REL-RWMD k-NNAccuracy95.61Unverified
3Orthogonalized Soft VSMAccuracy92.65Unverified
4MAGNETF189.9Unverified
5VLAWEF189.3Unverified
6KD-LSTMregF188.9Unverified
7LSTM-reg (single model)F187Unverified
8SCDV-MSF182.71Unverified
#ModelMetricClaimedVerifiedStatus
1ACNetAccuracy83.5Unverified
2LGCNAccuracy83.3Unverified
3GATAccuracy83Unverified
4MoNetAccuracy81.7Unverified
5DeepWalkAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)F188.1Unverified
2NCBI_BERT(large) (P)F187.3Unverified
3SciFive-largeF186.08Unverified
4BioGPTMicro F185.12Unverified
5PubMedBERT uncasedMicro F182.32Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy99.59Unverified
2Orthogonalized Soft VSMAccuracy97.73Unverified
3ApproxRepSetAccuracy95.73Unverified
4REL-RWMD k-NNAccuracy95.18Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy94.31Unverified
2Orthogonalized Soft VSMAccuracy93.42Unverified
3REL-RWMD k-NNAccuracy93.03Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy72.6Unverified
2REL-RWMD k-NNAccuracy71.05Unverified
3Orthogonalized Soft VSMAccuracy69.21Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregF172.9Unverified
2MAGNETF169.6Unverified
#ModelMetricClaimedVerifiedStatus
1REL-RWMD k-NNAccuracy96.85Unverified
2ApproxRepSetAccuracy96.24Unverified
#ModelMetricClaimedVerifiedStatus
1Document Classification Using Importance of SentencesAccuracy54.8Unverified
2LSTM-reg (single model)Accuracy52.8Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy59.06Unverified
2REL-RWMD k-NNAccuracy56.8Unverified
#ModelMetricClaimedVerifiedStatus
1SPECTERF1 (micro)82Unverified
2SciNCLF1 (micro)81.4Unverified
#ModelMetricClaimedVerifiedStatus
1SciNCLF1 (micro)88.7Unverified
2SPECTERF1 (micro)86.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvTextTMAccuracy91.28Unverified
2HDLTexAccuracy90.93Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy95.38Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy64.4Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy89.81Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy75Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy86.5Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy86.07Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy76.58Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregAccuracy69.4Unverified