SOTAVerified

Document Classification

Document Classification is a procedure of assigning one or more labels to a document from a predetermined set of labels.

Source: Long-length Legal Document Classification

Papers

Showing 125 of 641 papers

TitleStatusHype
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessCode6
BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and MiningCode4
Pre-Training with Whole Word Masking for Chinese BERTCode3
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language ModelsCode3
Visually Guided Generative Text-Layout Pre-training for Document IntelligenceCode2
LinkBERT: Pretraining Language Models with Document LinksCode2
One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian OptimizationCode2
Document Classification for COVID-19 LiteratureCode1
Data Programming by Demonstration: A Framework for Interactively Learning Labeling FunctionsCode1
SPECTER: Document-level Representation Learning using Citation-informed TransformersCode1
Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequencesCode1
DocBERT: BERT for Document ClassificationCode1
A Comparative Study of Pretrained Language Models for Long Clinical TextCode1
A Corpus for Multilingual Document Classification in Eight LanguagesCode1
Can a Fruit Fly Learn Word Embeddings?Code1
Bioformer: an efficient transformer language model for biomedical text miningCode1
ChordMixer: A Scalable Neural Attention Model for Sequences with Different LengthsCode1
Aspect-based Document Similarity for Research PapersCode1
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERTCode1
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERTCode1
A Sentence-level Hierarchical BERT Model for Document Classification with Limited Labelled DataCode1
Bridge Correlational Neural Networks for Multilingual Multimodal Representation LearningCode1
Balancing Methods for Multi-label Text Classification with Long-Tailed Class DistributionCode1
ContraDoc: Understanding Self-Contradictions in Documents with Large Language ModelsCode1
Pre-training technique to localize medical BERT and enhance biomedical BERTCode1
Show:102550
← PrevPage 1 of 26Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy97.17Unverified
2REL-RWMD k-NNAccuracy95.61Unverified
3Orthogonalized Soft VSMAccuracy92.65Unverified
4MAGNETF189.9Unverified
5VLAWEF189.3Unverified
6KD-LSTMregF188.9Unverified
7LSTM-reg (single model)F187Unverified
8SCDV-MSF182.71Unverified
#ModelMetricClaimedVerifiedStatus
1ACNetAccuracy83.5Unverified
2LGCNAccuracy83.3Unverified
3GATAccuracy83Unverified
4MoNetAccuracy81.7Unverified
5DeepWalkAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)F188.1Unverified
2NCBI_BERT(large) (P)F187.3Unverified
3SciFive-largeF186.08Unverified
4BioGPTMicro F185.12Unverified
5PubMedBERT uncasedMicro F182.32Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy99.59Unverified
2Orthogonalized Soft VSMAccuracy97.73Unverified
3ApproxRepSetAccuracy95.73Unverified
4REL-RWMD k-NNAccuracy95.18Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy94.31Unverified
2Orthogonalized Soft VSMAccuracy93.42Unverified
3REL-RWMD k-NNAccuracy93.03Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy72.6Unverified
2REL-RWMD k-NNAccuracy71.05Unverified
3Orthogonalized Soft VSMAccuracy69.21Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregF172.9Unverified
2MAGNETF169.6Unverified
#ModelMetricClaimedVerifiedStatus
1REL-RWMD k-NNAccuracy96.85Unverified
2ApproxRepSetAccuracy96.24Unverified
#ModelMetricClaimedVerifiedStatus
1Document Classification Using Importance of SentencesAccuracy54.8Unverified
2LSTM-reg (single model)Accuracy52.8Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy59.06Unverified
2REL-RWMD k-NNAccuracy56.8Unverified
#ModelMetricClaimedVerifiedStatus
1SPECTERF1 (micro)82Unverified
2SciNCLF1 (micro)81.4Unverified
#ModelMetricClaimedVerifiedStatus
1SciNCLF1 (micro)88.7Unverified
2SPECTERF1 (micro)86.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvTextTMAccuracy91.28Unverified
2HDLTexAccuracy90.93Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy95.38Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy64.4Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy89.81Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy75Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy86.5Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy86.07Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy76.58Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregAccuracy69.4Unverified