SOTAVerified

Document Classification

Document Classification is a procedure of assigning one or more labels to a document from a predetermined set of labels.

Source: Long-length Legal Document Classification

Papers

Showing 51100 of 641 papers

TitleStatusHype
Tsetlin Machine Embedding: Representing Words Using Logical ExpressionsCode1
HDLTex: Hierarchical Deep Learning for Text ClassificationCode1
Pre-training technique to localize medical BERT and enhance biomedical BERTCode1
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemCode1
DocBERT: BERT for Document ClassificationCode1
A Comparative Study of Pretrained Language Models for Long Clinical TextCode1
Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequencesCode1
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERTCode1
Document Classification for COVID-19 LiteratureCode1
Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia ArticlesCode1
A Sentence-level Hierarchical BERT Model for Document Classification with Limited Labelled DataCode1
NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long DocumentsCode1
ContraDoc: Understanding Self-Contradictions in Documents with Large Language ModelsCode1
Aspect-based Document Similarity for Research PapersCode1
Data Programming by Demonstration: A Framework for Interactively Learning Labeling FunctionsCode1
SPECTER: Document-level Representation Learning using Citation-informed TransformersCode1
HEAL: Hierarchical Embedding Alignment Loss for Improved Retrieval and Representation LearningCode1
Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many ClassesCode1
GeoGalactica: A Scientific Large Language Model in GeoscienceCode1
German's Next Language ModelCode1
Semi-Supervised Classification with Graph Convolutional NetworksCode1
Hierarchical Metadata-Aware Document Categorization under Weak SupervisionCode1
Hierarchical Transformers for Long Document ClassificationCode1
HiPool: Modeling Long Documents Using Graph Neural NetworksCode1
Bioformer: an efficient transformer language model for biomedical text miningCode1
Improving Document Classification with Multi-Sense EmbeddingsCode1
Keyword Assisted Topic ModelsCode1
L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic LanguagesCode1
LSD-C: Linearly Separable Deep ClustersCode1
MAGNET: Multi-Label Text Classification using Attention-based Graph Neural NetworkCode1
Massively Multilingual Sparse Word RepresentationsCode1
Minimally Supervised Categorization of Text with MetadataCode1
Bridge Correlational Neural Networks for Multilingual Multimodal Representation LearningCode1
Can a Fruit Fly Learn Word Embeddings?Code1
Multi-label Few/Zero-shot Learning with Knowledge Aggregated from Multiple Label GraphsCode1
A Simple Approach to Learning Unsupervised Multilingual Embeddings0
A Multi-task Approach to Learning Multilingual Representations0
A Semi-supervised Approach for Natural Language Call Routing0
A semi-automatic method for document classification in the shipping industry0
A Multiplicative Model for Learning Distributed Text-Based Attribute Representations0
A Semantic Cover Approach for Topic Modeling0
A Multi-Modal Multilingual Benchmark for Document Image Classification0
A Methodology for Evaluating Timeline Generation Algorithms based on Deep Semantic Units0
Argument Component Classification by Relation Identification by Neural Network and TextRank0
Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs0
Clustering of Multi-Word Named Entity variants: Multilingual Evaluation0
A Leveled Reading Corpus of Modern Standard Arabic0
A Comparative Study of Conversion Aided Methods for WordNet Sentence Textual Similarity0
Approximate Conditional Coverage & Calibration via Neural Model Approximations0
Applying Naive Bayes Classification to Google Play Apps Categorization0
Show:102550
← PrevPage 2 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy97.17Unverified
2REL-RWMD k-NNAccuracy95.61Unverified
3Orthogonalized Soft VSMAccuracy92.65Unverified
4MAGNETF189.9Unverified
5VLAWEF189.3Unverified
6KD-LSTMregF188.9Unverified
7LSTM-reg (single model)F187Unverified
8SCDV-MSF182.71Unverified
#ModelMetricClaimedVerifiedStatus
1ACNetAccuracy83.5Unverified
2LGCNAccuracy83.3Unverified
3GATAccuracy83Unverified
4MoNetAccuracy81.7Unverified
5DeepWalkAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1BioLinkBERT (large)F188.1Unverified
2NCBI_BERT(large) (P)F187.3Unverified
3SciFive-largeF186.08Unverified
4BioGPTMicro F185.12Unverified
5PubMedBERT uncasedMicro F182.32Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy99.59Unverified
2Orthogonalized Soft VSMAccuracy97.73Unverified
3ApproxRepSetAccuracy95.73Unverified
4REL-RWMD k-NNAccuracy95.18Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy94.31Unverified
2Orthogonalized Soft VSMAccuracy93.42Unverified
3REL-RWMD k-NNAccuracy93.03Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy72.6Unverified
2REL-RWMD k-NNAccuracy71.05Unverified
3Orthogonalized Soft VSMAccuracy69.21Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregF172.9Unverified
2MAGNETF169.6Unverified
#ModelMetricClaimedVerifiedStatus
1REL-RWMD k-NNAccuracy96.85Unverified
2ApproxRepSetAccuracy96.24Unverified
#ModelMetricClaimedVerifiedStatus
1Document Classification Using Importance of SentencesAccuracy54.8Unverified
2LSTM-reg (single model)Accuracy52.8Unverified
#ModelMetricClaimedVerifiedStatus
1ApproxRepSetAccuracy59.06Unverified
2REL-RWMD k-NNAccuracy56.8Unverified
#ModelMetricClaimedVerifiedStatus
1SPECTERF1 (micro)82Unverified
2SciNCLF1 (micro)81.4Unverified
#ModelMetricClaimedVerifiedStatus
1SciNCLF1 (micro)88.7Unverified
2SPECTERF1 (micro)86.4Unverified
#ModelMetricClaimedVerifiedStatus
1ConvTextTMAccuracy91.28Unverified
2HDLTexAccuracy90.93Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy95.38Unverified
#ModelMetricClaimedVerifiedStatus
1ChuLoAccuracy64.4Unverified
#ModelMetricClaimedVerifiedStatus
1MPAD-pathAccuracy89.81Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy75Unverified
#ModelMetricClaimedVerifiedStatus
1BilBOWAAccuracy86.5Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy86.07Unverified
#ModelMetricClaimedVerifiedStatus
1HDLTexAccuracy76.58Unverified
#ModelMetricClaimedVerifiedStatus
1KD-LSTMregAccuracy69.4Unverified