| LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions | Apr 27, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| BloombergGPT: A Large Language Model for Finance | Mar 30, 2023 | Causal JudgmentCommon Sense Reasoning | CodeCode Available | 0 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| LLaMA: Open and Efficient Foundation Language Models | Feb 27, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 7 |
| Exploring the Benefits of Training Expert Language Models over Instruction Tuning | Feb 7, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Numeracy from Literacy: Data Science as an Emergent Skill from Large Language Models | Jan 31, 2023 | DescriptiveFeature Importance | —Unverified | 0 |
| POIBERT: A Transformer-based Model for the Tour Recommendation Problem | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Implicit causality in GPT-2: a case study | Dec 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 |
| Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question Answering | Oct 29, 2022 | Binary ClassificationQuestion Answering | CodeCode Available | 1 |
| Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models | Oct 28, 2022 | Common Sense ReasoningCoreference Resolution | —Unverified | 0 |
| DiscoSense: Commonsense Reasoning with Discourse Connectives | Oct 22, 2022 | Sentence Completion | CodeCode Available | 0 |
| Task Compass: Scaling Multi-task Pre-training with Task Prefix | Oct 12, 2022 | Common Sense ReasoningData Augmentation | CodeCode Available | 1 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Effidit: Your AI Writing Assistant | Aug 3, 2022 | Keywords to SentencesRetrieval | —Unverified | 0 |
| SC-Ques: A Sentence Completion Question Dataset for English as a Second Language Learners | Jun 24, 2022 | SentenceSentence Completion | CodeCode Available | 0 |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Jun 9, 2022 | MisconceptionsSentence | CodeCode Available | 5 |
| Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals | May 1, 2022 | SentenceSentence Completion | CodeCode Available | 1 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 |
| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| Efficient Language Modeling with Sparse all-MLP | Mar 14, 2022 | AllCommon Sense Reasoning | —Unverified | 0 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| SeqPATE: Differentially Private Text Generation via Knowledge Distillation | Sep 29, 2021 | Knowledge DistillationSentence | —Unverified | 0 |
| Language Models as a Knowledge Source for Cognitive Agents | Sep 17, 2021 | Language ModellingNatural Language Inference | —Unverified | 0 |