| Labeling supervised fine-tuning data with the scaling law | May 5, 2024 | coreference-resolutionCoreference Resolution | CodeCode Available | 7 |
| Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling | Apr 3, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 6 |
| N-Grammer: Augmenting Transformers with latent n-grams | Jul 13, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 4 |
| Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective | Oct 16, 2022 | Coreference ResolutionMultiple-choice | CodeCode Available | 4 |
| RAKG:Document-level Retrieval Augmented Knowledge Graph Construction | Apr 14, 2025 | coreference-resolutionCoreference Resolution | CodeCode Available | 3 |
| Attention Is All You Need | Jun 12, 2017 | Abstractive Text SummarizationAll | CodeCode Available | 3 |
| Scaling Instruction-Finetuned Language Models | Oct 20, 2022 | Coreference ResolutionCross-Lingual Question Answering | CodeCode Available | 3 |
| Finetuned Language Models Are Zero-Shot Learners | Sep 3, 2021 | ARCCommon Sense Reasoning | CodeCode Available | 3 |
| ST-MoE: Designing Stable and Transferable Sparse Expert Models | Feb 17, 2022 | ARCCommon Sense Reasoning | CodeCode Available | 3 |
| BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | Oct 11, 2018 | Citation Intent ClassificationCommon Sense Reasoning | CodeCode Available | 3 |
| Language Models are Few-Shot Learners | May 28, 2020 | answerability predictionArticles | CodeCode Available | 3 |
| The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning | May 23, 2023 | Common Sense ReasoningCommon Sense Reasoning (Zero-Shot) | CodeCode Available | 2 |
| LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions | Apr 27, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| Ask Me Anything: A simple strategy for prompting language models | Oct 5, 2022 | Coreference ResolutionNatural Language Inference | CodeCode Available | 2 |
| Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends | Jul 31, 2024 | coreference-resolutionCoreference Resolution | CodeCode Available | 2 |
| Hungry Hungry Hippos: Towards Language Modeling with State Space Models | Dec 28, 2022 | 8kCoreference Resolution | CodeCode Available | 2 |
| AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model | Aug 2, 2022 | Causal Language ModelingCommon Sense Reasoning | CodeCode Available | 2 |
| Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | Oct 23, 2019 | Answer GenerationCommon Sense Reasoning | CodeCode Available | 2 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 |
| DeBERTa: Decoding-enhanced BERT with Disentangled Attention | Jun 5, 2020 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 |
| A Cluster Ranking Model for Full Anaphora Resolution | Nov 21, 2019 | Coreference Resolution | CodeCode Available | 1 |
| 2∗n is better than n^2: Decomposing Event Coreference Resolution into Two Tractable Problems | Jul 1, 2023 | coreference-resolutionCoreference Resolution | CodeCode Available | 1 |
| Autoregressive Structured Prediction with Language Models | Oct 26, 2022 | Coreference ResolutionNamed Entity Recognition | CodeCode Available | 1 |
| A Case Study for Compliance as Code with Graphs and Language Models: Public release of the Regulatory Knowledge Graph | Feb 3, 2023 | coreference-resolutionCoreference Resolution | CodeCode Available | 1 |