| Llama 2: Open Foundation and Fine-Tuned Chat Models | Jul 18, 2023 | Arithmetic Reasoning | CodeCode Available | 8 |
| LLaMA: Open and Efficient Foundation Language Models | Feb 27, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 7 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| Mamba: Linear-Time Sequence Modeling with Selective State Spaces | Dec 1, 2023 | 2D Pose EstimationCommon Sense Reasoning | CodeCode Available | 6 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Jun 9, 2022 | MisconceptionsSentence | CodeCode Available | 5 |
| Finetuned Language Models Are Zero-Shot Learners | Sep 3, 2021 | ARCCommon Sense Reasoning | CodeCode Available | 3 |
| Language Models are Few-Shot Learners | May 28, 2020 | answerability predictionArticles | CodeCode Available | 3 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 |
| The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning | May 23, 2023 | Common Sense ReasoningCommon Sense Reasoning (Zero-Shot) | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions | Apr 27, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Jan 5, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 2 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 |
| Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DeBERTa: Decoding-enhanced BERT with Disentangled Attention | Jun 5, 2020 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| HONEST: Measuring Hurtful Sentence Completion in Language Models | Jun 1, 2021 | Hate Speech DetectionHurtful Sentence Completion | CodeCode Available | 1 |
| UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark | Mar 24, 2021 | Common Sense ReasoningHellaSwag | CodeCode Available | 1 |
| Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question Answering | Oct 29, 2022 | Binary ClassificationQuestion Answering | CodeCode Available | 1 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| GePpeTto Carves Italian into a Language Model | Apr 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring the Benefits of Training Expert Language Models over Instruction Tuning | Feb 7, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| RoBERTa: A Robustly Optimized BERT Pretraining Approach | Jul 26, 2019 | Common Sense ReasoningDocument Image Classification | CodeCode Available | 1 |
| Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals | May 1, 2022 | SentenceSentence Completion | CodeCode Available | 1 |
| Task Compass: Scaling Multi-task Pre-training with Task Prefix | Oct 12, 2022 | Common Sense ReasoningData Augmentation | CodeCode Available | 1 |
| LLM in a flash: Efficient Large Language Model Inference with Limited Memory | Dec 12, 2023 | CPUGPU | —Unverified | 0 |
| Ranking LLMs by compression | Jun 20, 2024 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks | Oct 7, 2020 | ClassificationGeneral Classification | —Unverified | 0 |
| BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Jul 14, 2024 | Bias DetectionQuestion Answering | —Unverified | 0 |
| Clause Final Verb Prediction in Hindi: Evidence for Noisy Channel Model of Communication | Jun 1, 2021 | PredictionSentence | —Unverified | 0 |
| Computational Approaches to Sentence Completion | Jul 1, 2012 | Language ModellingQuestion Answering | —Unverified | 0 |
| Contextual LSTM (CLSTM) models for Large scale NLP tasks | Feb 19, 2016 | ArticlesParaphrase Generation | —Unverified | 0 |
| Defining and Evaluating Fair Natural Language Generation | Jul 28, 2020 | FairnessSentence | —Unverified | 0 |
| Dependency Language Models for Sentence Completion | Oct 1, 2013 | Language ModellingMachine Translation | —Unverified | 0 |
| Differentially Private n-gram Extraction | Aug 5, 2021 | Response GenerationSentence | —Unverified | 0 |
| Efficient Language Modeling with Sparse all-MLP | Mar 14, 2022 | AllCommon Sense Reasoning | —Unverified | 0 |
| Effidit: Your AI Writing Assistant | Aug 3, 2022 | Keywords to SentencesRetrieval | —Unverified | 0 |
| Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language | May 3, 2024 | PredictionSentence | —Unverified | 0 |
| Evaluating Gender Bias in Large Language Models | Nov 14, 2024 | Model SelectionSentence | —Unverified | 0 |
| Expect the unexpected: Harnessing Sentence Completion for Sarcasm Detection | Jul 19, 2017 | Sarcasm DetectionSentence | —Unverified | 0 |
| Exploiting Language Models as a Source of Knowledge for Cognitive Agents | Sep 5, 2023 | Natural Language InferenceQuestion Answering | —Unverified | 0 |
| Exploiting Linguistic Features for Sentence Completion | Aug 1, 2016 | SentenceSentence Completion | —Unverified | 0 |
| Filling Conversation Ellipsis for Better Social Dialog Understanding | Nov 25, 2019 | PredictionSemantic Role Labeling | —Unverified | 0 |
| Hybrid Model For Word Prediction Using Naive Bayes and Latent Information | Mar 2, 2018 | SentenceSentence Completion | —Unverified | 0 |
| iCap: Interactive Image Captioning with Predictive Text | Jan 31, 2020 | Image CaptioningSentence | —Unverified | 0 |
| Illuminating the Black Box: A Psychometric Investigation into the Multifaceted Nature of Large Language Models | Dec 21, 2023 | SentenceSentence Completion | —Unverified | 0 |
| Implicit causality in GPT-2: a case study | Dec 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |