SOTAVerified

Text Classification

Text Classification is the task of assigning a sentence or document an appropriate category. The categories depend on the chosen dataset and can range from topics.

Text Classification problems include emotion classification, news classification, citation intent classification, among others. Benchmark datasets for evaluating text classification capabilities include GLUE, AGNews, among others.

In recent years, deep learning techniques like XLNet and RoBERTa have attained some of the biggest performance jumps for text classification problems.

( Image credit: Text Classification Algorithms: A Survey )

Papers

Showing 10511100 of 3635 papers

TitleStatusHype
Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class ClassificationCode0
Think from Words(TFW): Initiating Human-Like Cognition in Large Language Models Through Think from Words for Japanese Text-level Classification0
Comparative Analysis of Multilingual Text Classification & Identification through Deep Learning and Embedding Visualization0
An Evaluation Framework for Mapping News Headlines to Event Classes in a Knowledge GraphCode0
REDUCR: Robust Data Downsampling Using Class Priority ReweightingCode0
BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre ClassificationCode0
Dialogue Quality and Emotion Annotations for Customer Support ConversationsCode0
Efficient Trigger Word Insertion0
Fair Text Classification with Wasserstein IndependenceCode0
Evolving Domain Adaptation of Pretrained Language Models for Text Classification0
Downstream Trade-offs of a Family of Text WatermarksCode0
Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization0
Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt OptimisationCode0
Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets0
Explore Spurious Correlations at the Concept Level in Language Models for Text ClassificationCode0
Learning Mutually Informed Representations for Characters and SubwordsCode0
A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated TextsCode0
Fuse to Forget: Bias Reduction and Selective Memorization through Model FusionCode0
Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions0
GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect0
Robust Text Classification: Analyzing Prototype-Based NetworksCode0
Making LLMs Worth Every Penny: Resource-Limited Text Classification in Banking0
Leveraging Artificial Intelligence Technology for Mapping Research to Sustainable Development Goals: A Case Study0
RankAug: Augmented data ranking for text classification0
KPI Extraction from Maintenance Work Orders -- A Comparison of Expert Labeling, Text Classification and AI-Assisted Tagging for Computing Failure Rates of Wind Turbines0
Learning to Learn for Few-shot Continual Active Learning0
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection0
Tackling Concept Shift in Text Classification using Entailment-style Modeling0
An energy-based comparative analysis of common approaches to text classification in the Legal domain0
Keyword-optimized Template Insertion for Clinical Information Extraction via Prompt-based Learning0
XAI-CLASS: Explanation-Enhanced Text Classification with Extremely Weak Supervision0
Breaking the Token Barrier: Chunking and Convolution for Efficient Long Text Classification with BERT0
Interpretable-by-Design Text Understanding with Iteratively Generated Concept BottleneckCode0
Sample based Explanations via Generalized Representers0
Do Not Harm Protected Groups in Debiasing Language Representation Models0
torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP0
Sliceformer: Make Multi-head Attention as Simple as Sorting in Discriminative TasksCode0
On the Interplay between Fairness and Explainability0
Multi-label Text Classification using GloVe and Neural Network Models0
Statistical Depth for Ranking and Characterizing Transformer-Based Text EmbeddingsCode0
Text2Topic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot CapabilitiesCode0
MedAI Dialog Corpus (MEDIC): Zero-Shot Classification of Doctor and AI Responses in Health Consultations0
Label-Aware Automatic Verbalizer for Few-Shot Text Classification0
Rather a Nurse than a Physician -- Contrastive Explanations under Investigation0
Learning under Label Proportions for Text Classification0
The effect of stemming and lemmatization on Portuguese fake news text classification0
United We Stand: Using Epoch-wise Agreement of Ensembles to Combat OverfitCode0
Generative Calibration for In-context LearningCode0
A Comprehensive Study of Privacy Risks in Curriculum Learning0
VIBE: Topic-Driven Temporal Adaptation for Twitter Classification0
Show:102550
← PrevPage 22 of 73Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST5-XXLAccuracy73.42Unverified
2ST5-XLAccuracy72.84Unverified
3ST5-LargeAccuracy72.31Unverified
4Ada SimilarityAccuracy70.44Unverified
5SGPT-5.8B-nliAccuracy70.14Unverified
6ST5-BaseAccuracy69.81Unverified
7SGPT-5.8B-msmarcoAccuracy68.13Unverified
8MPNet-multilingualAccuracy67.91Unverified
9GTR-XXLAccuracy67.41Unverified
10SimCSE-BERT-supAccuracy67.32Unverified
#ModelMetricClaimedVerifiedStatus
1Mistral-Small-24B + CAPOError15.7Unverified
2ToWE-SGError14Unverified
3Qwen2.5-32B + CAPOError12.93Unverified
4Llama-3.3-70B + CAPOError11.2Unverified
5Seq2CNN with GWS(50)Error9.64Unverified
6Char-level CNNError9.51Unverified
7SVDCNNError9.45Unverified
8VDCNError8.67Unverified
9Balanced+bi-leaf-RNNError7.9Unverified
10CCCapsNetError7.61Unverified
#ModelMetricClaimedVerifiedStatus
1Seq2CNN(50)Error2.77Unverified
2Char-level CNNError1.55Unverified
3SWEM-concatError1.43Unverified
4FastTextError1.4Unverified
5VDCNError1.29Unverified
6CCCapsNetError1.28Unverified
7Balanced+bi-leaf-RNNError1.2Unverified
8BERT large UDAError1.09Unverified
9M-ACNNError1.07Unverified
10EXAMError1Unverified
#ModelMetricClaimedVerifiedStatus
1DeBERTaAccuracy98.45Unverified
2C-BERT (ESGNN + BERT)Accuracy98.28Unverified
3ESGNNAccuracy98.23Unverified
4RoBERTaGCNAccuracy98.2Unverified
5BERTAccuracy98.17Unverified
6SGNNAccuracy98.09Unverified
7ERNIE 2.0Accuracy98.04Unverified
8DistilBERTAccuracy97.98Unverified
9Our Model*Accuracy97.8Unverified
10ALBERTv2Accuracy97.62Unverified
#ModelMetricClaimedVerifiedStatus
1TM-GloveError9.96Unverified
2byte mLSTM7Error9.6Unverified
3SWEM-averError7.8Unverified
4DELTA (CNN)Error7.8Unverified
5Capsule-BError7.2Unverified
6STM+TSED+PT+2LError7.04Unverified
7GRU-RNN-GLOVEError7Unverified
8MPAD-pathError6.2Unverified
9VLAWEError5.8Unverified
10C-LSTMError5.4Unverified
#ModelMetricClaimedVerifiedStatus
1LinearSVM+TFIDFAccuracy93Unverified
2RoBERTaGCNAccuracy89.5Unverified
3SSGCAccuracy88.6Unverified
4SGCAccuracy88.5Unverified
5SGCNAccuracy88.5Unverified
6RMDL (15 RDLs)Accuracy87.91Unverified
7Sparse Tensor ClassifierAccuracy87.3Unverified
8GraphStarAccuracy86.9Unverified
9NABoE-fullAccuracy86.8Unverified
10Text GCNAccuracy86.34Unverified
#ModelMetricClaimedVerifiedStatus
1ELECTRA + ANNF199.6Unverified
2ERNIE + ANNF199.4Unverified
3XLNet + ANNF199.2Unverified
4RoBERTa + ANNF198.7Unverified
5Longformer + ANNF193.9Unverified
6BERT + ANNF190.5Unverified
7ALBERT + ANNF179.7Unverified
8BERTF175Unverified
9DistilBERTF174.4Unverified
10XLNetF174Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTaGCNAccuracy72.8Unverified
2Our Model*Accuracy69.4Unverified
3SSGCAccuracy68.5Unverified
4SGCAccuracy68.5Unverified
5SGCNAccuracy68.5Unverified
6Text GCNAccuracy68.36Unverified
7GraphStarAccuracy64.2Unverified
8ApproxRepSetAccuracy64.06Unverified
9REL-RWMD k-NNAccuracy58.74Unverified
10CNN+LowercasedAccuracy36.2Unverified
#ModelMetricClaimedVerifiedStatus
1BERT-ITPT-FiTAccuracy77.62Unverified
2DRNNAccuracy76.26Unverified
3DELTA (HAN)Accuracy75.1Unverified
4EXAMAccuracy74.8Unverified
5DNC+CUWAccuracy74.3Unverified
6ULMFiT (Small data)Accuracy74.3Unverified
7CCCapsNetAccuracy73.85Unverified
8SWEM-concatAccuracy73.53Unverified
9FastTextAccuracy72.3Unverified
10Seq2CNN(50)Accuracy55.39Unverified
#ModelMetricClaimedVerifiedStatus
1DeBERTaAccuracy90.21Unverified
2RoBERTaGCNAccuracy89.7Unverified
3ERNIE 2.0 (optimized)Accuracy89.53Unverified
4RoBERTaAccuracy89.42Unverified
5ERNIE 2.0Accuracy88.97Unverified
6BERTAccuracy86.94Unverified
7ALBERTv2Accuracy86.02Unverified
8DistilBERTAccuracy85.31Unverified
9SSGCAccuracy76.7Unverified
#ModelMetricClaimedVerifiedStatus
1CliReBERT (P0L3/clirebert_clirevocab_uncased)Evaluation Macro F10.65Unverified
2ClimateBERT (climatebert/distilroberta-base-climate-f)Evaluation Macro F10.64Unverified
3BERT (google-bert/bert-base-uncased)Evaluation Macro F10.61Unverified
4CliSciBERT (P0L3/cliscibert_scivocab_uncased)Evaluation Macro F10.61Unverified
5SciBERT (allenai/scibert_scivocab_cased)Evaluation Macro F10.59Unverified
6DistilRoBERTa (distilbert/distilroberta-base)Evaluation Macro F10.58Unverified
7SciClimateBERT (P0L3/sciclimatebert)Evaluation Macro F10.58Unverified
8RoBERTa (FacebookAI/roberta-base)Evaluation Macro F10.57Unverified
#ModelMetricClaimedVerifiedStatus
1Human (Post-Rec.) (Spangher et al., 2021)macro F173.69Unverified
2MT-Mac (Spangher et al., 2021)macro F163.46Unverified
3MT-Mic (Spangher et al., 2021)macro F161.89Unverified
4RL-IP/TT (Choubey et al., 2021)macro F157Unverified
5Document LSTM + Document encoding (Choubey et al., 2020)macro F154.4Unverified
6CRF Fine-grained (Choubey et al., 2020)macro F152.9Unverified
7Human (Blind) (Spangher et al., 2021)macro F146.18Unverified
8Feature-based (SVM) (Choubey et al., 2020)macro F138.3Unverified
#ModelMetricClaimedVerifiedStatus
11-6 BertGCNAccuracy96.6Unverified
2GraphStarAccuracy95Unverified
3Our Model*Accuracy94.6Unverified
4SSGCAccuracy94.5Unverified
5SGCAccuracy94Unverified
6SGCNAccuracy94Unverified
7Text GCNAccuracy93.56Unverified
8TM-GloveAccuracy89.14Unverified