| Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models | Nov 27, 2024 | ClassificationSentence | CodeCode Available | 3 |
| BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | Oct 11, 2018 | Citation Intent ClassificationCommon Sense Reasoning | CodeCode Available | 3 |
| PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts | Oct 17, 2017 | General ClassificationSentence | CodeCode Available | 3 |
| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | Feb 5, 2024 | In-Context LearningMetric Learning | CodeCode Available | 2 |
| ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT | Nov 30, 2022 | Molecular System PredictionSentence Classification | CodeCode Available | 2 |
| CLUE: A Chinese Language Understanding Evaluation Benchmark | Apr 13, 2020 | General ClassificationMachine Reading Comprehension | CodeCode Available | 2 |
| A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations | May 20, 2025 | SentenceSentence Classification | CodeCode Available | 1 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification | Nov 1, 2023 | ClassificationLanguage Modelling | CodeCode Available | 1 |
| DISCO: Distilling Counterfactuals with Large Language Models | Dec 20, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| Quantum Natural Language Generation on Near-Term Devices | Nov 1, 2022 | Image ManipulationMusic Generation | CodeCode Available | 1 |
| Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual Understanding With Multilingual Language Models | Oct 22, 2022 | Cross-Lingual TransferNatural Language Understanding | CodeCode Available | 1 |
| Finding Dataset Shortcuts with Grammar Induction | Oct 20, 2022 | DiagnosticSentence | CodeCode Available | 1 |
| AttentionSiteDTI: an interpretable graph-based model for drug-target interaction prediction using NLP sentence-level relation classification | Jul 12, 2022 | Drug DiscoveryRelation Classification | CodeCode Available | 1 |
| SynWMD: Syntax-aware Word Mover's Distance for Sentence Similarity Evaluation | Jun 20, 2022 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| DataMUX: Data Multiplexing for Neural Networks | Feb 18, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Evidence Selection as a Token-Level Prediction Task | Nov 1, 2021 | Claim VerificationEvidence Selection | CodeCode Available | 1 |
| Revisiting Self-Training for Few-Shot Learning of Language Model | Oct 4, 2021 | BenchmarkingFew-Shot Learning | CodeCode Available | 1 |
| Sentence Bottleneck Autoencoders from Transformer Language Models | Aug 31, 2021 | DecoderDenoising | CodeCode Available | 1 |
| UIUC\_BioNLP at SemEval-2021 Task 11: A Cascade of Neural Models for Structuring Scholarly NLP Contributions | Aug 1, 2021 | SentenceSentence Classification | CodeCode Available | 1 |
| Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers | Jul 12, 2021 | Active LearningClassification | CodeCode Available | 1 |
| CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark | Jun 15, 2021 | Intent ClassificationMedical Concept Normalization | CodeCode Available | 1 |
| Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence | May 27, 2021 | ArticlesDocument Ranking | CodeCode Available | 1 |
| UIUC_BioNLP at SemEval-2021 Task 11: A Cascade of Neural Models for Structuring Scholarly NLP Contributions | May 12, 2021 | Keyphrase ExtractionRelation Extraction | CodeCode Available | 1 |
| MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs | Apr 18, 2021 | Abstractive Text SummarizationMachine Translation | CodeCode Available | 1 |
| QNLP in Practice: Running Compositional Models of Meaning on a Quantum Computer | Feb 25, 2021 | SentenceSentence Classification | CodeCode Available | 1 |
| Cross-Domain Multi-Task Learning for Sequential Sentence Classification in Research Papers | Feb 11, 2021 | Multi-Task LearningSentence | CodeCode Available | 1 |
| BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla | Jan 1, 2021 | Document ClassificationLanguage Modeling | CodeCode Available | 1 |
| Improving BERT with Syntax-aware Local Attention | Dec 30, 2020 | Machine TranslationQuestion Answering | CodeCode Available | 1 |
| FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations | Oct 23, 2020 | NERPOS | CodeCode Available | 1 |
| IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding | Sep 11, 2020 | BenchmarkingDiversity | CodeCode Available | 1 |
| GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled Examples | Jul 1, 2020 | General ClassificationSentence | CodeCode Available | 1 |
| Discrete Latent Variable Representations for Low-Resource Text Classification | Jun 11, 2020 | ClassificationGeneral Classification | CodeCode Available | 1 |
| The Russian Drug Reaction Corpus and Neural Models for Drug Reactions and Effectiveness Detection in User Reviews | Apr 7, 2020 | General Classificationnamed-entity-recognition | CodeCode Available | 1 |
| Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data | Mar 16, 2020 | ClassificationData Augmentation | CodeCode Available | 1 |
| SciBERT: A Pretrained Language Model for Scientific Text | Mar 26, 2019 | Citation Intent ClassificationDependency Parsing | CodeCode Available | 1 |
| BioBERT: a pre-trained biomedical language representation model for biomedical text mining | Jan 25, 2019 | Drug–drug Interaction ExtractionFew-Shot Learning | CodeCode Available | 1 |
| Jointly Learning to Label Sentences and Tokens | Nov 14, 2018 | Grammatical Error DetectionSentence | CodeCode Available | 1 |
| ListOps: A Diagnostic Dataset for Latent Tree Learning | Apr 17, 2018 | DiagnosticListOps | CodeCode Available | 1 |
| Convolutional Neural Networks for Sentence Classification | Aug 25, 2014 | Emotion Recognition in ConversationGeneral Classification | CodeCode Available | 1 |
| Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks | Jan 5, 2025 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |
| Consolidating and Developing Benchmarking Datasets for the Nepali Natural Language Understanding Tasks | Nov 28, 2024 | BenchmarkingNatural Language Inference | —Unverified | 0 |
| Analyzing the Evolution of Graphs and Texts | Nov 9, 2024 | Representation LearningSentence Classification | —Unverified | 0 |
| Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers | Aug 6, 2024 | Classificationregression | —Unverified | 0 |
| Constructing the CORD-19 Vaccine Dataset | Jul 26, 2024 | Question AnsweringSentence | —Unverified | 0 |
| Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning | Jul 24, 2024 | Anomaly DetectionIn-Context Learning | CodeCode Available | 0 |
| MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models | Jul 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models | Jun 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Estimating the Level of Dialectness Predicts Interannotator Agreement in Multi-dialect Arabic Datasets | May 18, 2024 | SentenceSentence Classification | CodeCode Available | 0 |