SOTAVerified

Sentence Classification

Papers

Showing 150 of 303 papers

TitleStatusHype
Cyber-Attack Technique Classification Using Two-Stage Trained Large Language ModelsCode3
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingCode3
PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical AbstractsCode3
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric LearningCode2
ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERTCode2
CLUE: A Chinese Language Understanding Evaluation BenchmarkCode2
A Personalized Conversational Benchmark: Towards Simulating Personalized ConversationsCode1
Multi-label Sequential Sentence Classification via Large Language ModelCode1
Multi-Granularity Guided Fusion-in-DecoderCode1
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot ClassificationCode1
DISCO: Distilling Counterfactuals with Large Language ModelsCode1
Quantum Natural Language Generation on Near-Term DevicesCode1
Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual Understanding With Multilingual Language ModelsCode1
Finding Dataset Shortcuts with Grammar InductionCode1
AttentionSiteDTI: an interpretable graph-based model for drug-target interaction prediction using NLP sentence-level relation classificationCode1
SynWMD: Syntax-aware Word Mover's Distance for Sentence Similarity EvaluationCode1
DataMUX: Data Multiplexing for Neural NetworksCode1
Evidence Selection as a Token-Level Prediction TaskCode1
Revisiting Self-Training for Few-Shot Learning of Language ModelCode1
Sentence Bottleneck Autoencoders from Transformer Language ModelsCode1
UIUC\_BioNLP at SemEval-2021 Task 11: A Cascade of Neural Models for Structuring Scholarly NLP ContributionsCode1
Revisiting Uncertainty-based Query Strategies for Active Learning with TransformersCode1
CBLUE: A Chinese Biomedical Language Understanding Evaluation BenchmarkCode1
Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat ViolenceCode1
UIUC_BioNLP at SemEval-2021 Task 11: A Cascade of Neural Models for Structuring Scholarly NLP ContributionsCode1
MT6: Multilingual Pretrained Text-to-Text Transformer with Translation PairsCode1
QNLP in Practice: Running Compositional Models of Meaning on a Quantum ComputerCode1
Cross-Domain Multi-Task Learning for Sequential Sentence Classification in Research PapersCode1
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in BanglaCode1
Improving BERT with Syntax-aware Local AttentionCode1
FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input RepresentationsCode1
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language UnderstandingCode1
GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled ExamplesCode1
Discrete Latent Variable Representations for Low-Resource Text ClassificationCode1
The Russian Drug Reaction Corpus and Neural Models for Drug Reactions and Effectiveness Detection in User ReviewsCode1
Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced DataCode1
SciBERT: A Pretrained Language Model for Scientific TextCode1
BioBERT: a pre-trained biomedical language representation model for biomedical text miningCode1
Jointly Learning to Label Sentences and TokensCode1
ListOps: A Diagnostic Dataset for Latent Tree LearningCode1
Convolutional Neural Networks for Sentence ClassificationCode1
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence BenchmarksCode0
Consolidating and Developing Benchmarking Datasets for the Nepali Natural Language Understanding Tasks0
Analyzing the Evolution of Graphs and Texts0
Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers0
Constructing the CORD-19 Vaccine Dataset0
Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context LearningCode0
MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models0
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models0
Estimating the Level of Dialectness Predicts Interannotator Agreement in Multi-dialect Arabic DatasetsCode0
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciBERTF184.9Unverified
2Structural-scaffoldsF184Unverified
3BiLSTM-Attention + ELMoF182.6Unverified
4Feature-Rich Random ForestF179.6Unverified
5BiLSTM-AttentionF177.2Unverified
#ModelMetricClaimedVerifiedStatus
1FE-MLM + SpanF178.1Unverified
2SciBERTF170.98Unverified
3Structural-scaffoldsF167.9Unverified
4Random ForestF153Unverified
#ModelMetricClaimedVerifiedStatus
1SciBERT (SciVocab)F165.71Unverified
2SciBERT (Base Vocab)F164.02Unverified
#ModelMetricClaimedVerifiedStatus
1Hierarchical Neural NetworksF192.6Unverified
2SciBERT (Base Vocab)F186.81Unverified
#ModelMetricClaimedVerifiedStatus
1SciBERT (SciVocab)F184.99Unverified
2SciBERT (Base Vocab)F184.43Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-largeMacro F170.9Unverified