SOTAVerified

Sentence Classification

Papers

Showing 125 of 303 papers

TitleStatusHype
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingCode3
Cyber-Attack Technique Classification Using Two-Stage Trained Large Language ModelsCode3
PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical AbstractsCode3
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric LearningCode2
ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERTCode2
CLUE: A Chinese Language Understanding Evaluation BenchmarkCode2
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language UnderstandingCode1
Finding Dataset Shortcuts with Grammar InductionCode1
Jointly Learning to Label Sentences and TokensCode1
Discrete Latent Variable Representations for Low-Resource Text ClassificationCode1
CBLUE: A Chinese Biomedical Language Understanding Evaluation BenchmarkCode1
GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled ExamplesCode1
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot ClassificationCode1
Improving BERT with Syntax-aware Local AttentionCode1
FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input RepresentationsCode1
Evidence Selection as a Token-Level Prediction TaskCode1
Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced DataCode1
DataMUX: Data Multiplexing for Neural NetworksCode1
Convolutional Neural Networks for Sentence ClassificationCode1
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in BanglaCode1
BioBERT: a pre-trained biomedical language representation model for biomedical text miningCode1
A Personalized Conversational Benchmark: Towards Simulating Personalized ConversationsCode1
AttentionSiteDTI: an interpretable graph-based model for drug-target interaction prediction using NLP sentence-level relation classificationCode1
Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat ViolenceCode1
DISCO: Distilling Counterfactuals with Large Language ModelsCode1
Show:102550
← PrevPage 1 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciBERTF184.9Unverified
2Structural-scaffoldsF184Unverified
3BiLSTM-Attention + ELMoF182.6Unverified
4Feature-Rich Random ForestF179.6Unverified
5BiLSTM-AttentionF177.2Unverified
#ModelMetricClaimedVerifiedStatus
1FE-MLM + SpanF178.1Unverified
2SciBERTF170.98Unverified
3Structural-scaffoldsF167.9Unverified
4Random ForestF153Unverified
#ModelMetricClaimedVerifiedStatus
1SciBERT (SciVocab)F165.71Unverified
2SciBERT (Base Vocab)F164.02Unverified
#ModelMetricClaimedVerifiedStatus
1Hierarchical Neural NetworksF192.6Unverified
2SciBERT (Base Vocab)F186.81Unverified
#ModelMetricClaimedVerifiedStatus
1SciBERT (SciVocab)F184.99Unverified
2SciBERT (Base Vocab)F184.43Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-largeMacro F170.9Unverified