| Llama 2: Open Foundation and Fine-Tuned Chat Models | Jul 18, 2023 | Arithmetic Reasoning | CodeCode Available | 8 |
| LLaMA: Open and Efficient Foundation Language Models | Feb 27, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 7 |
| Mamba: Linear-Time Sequence Modeling with Selective State Spaces | Dec 1, 2023 | 2D Pose EstimationCommon Sense Reasoning | CodeCode Available | 6 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Jun 9, 2022 | MisconceptionsSentence | CodeCode Available | 5 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| Finetuned Language Models Are Zero-Shot Learners | Sep 3, 2021 | ARCCommon Sense Reasoning | CodeCode Available | 3 |
| Language Models are Few-Shot Learners | May 28, 2020 | answerability predictionArticles | CodeCode Available | 3 |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Jan 5, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 2 |
| Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning | May 23, 2023 | Common Sense ReasoningCommon Sense Reasoning (Zero-Shot) | CodeCode Available | 2 |
| LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions | Apr 27, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| DeBERTa: Decoding-enhanced BERT with Disentangled Attention | Jun 5, 2020 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| Exploring the Benefits of Training Expert Language Models over Instruction Tuning | Feb 7, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question Answering | Oct 29, 2022 | Binary ClassificationQuestion Answering | CodeCode Available | 1 |
| Task Compass: Scaling Multi-task Pre-training with Task Prefix | Oct 12, 2022 | Common Sense ReasoningData Augmentation | CodeCode Available | 1 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals | May 1, 2022 | SentenceSentence Completion | CodeCode Available | 1 |
| HONEST: Measuring Hurtful Sentence Completion in Language Models | Jun 1, 2021 | Hate Speech DetectionHurtful Sentence Completion | CodeCode Available | 1 |
| UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark | Mar 24, 2021 | Common Sense ReasoningHellaSwag | CodeCode Available | 1 |
| GePpeTto Carves Italian into a Language Model | Apr 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RoBERTa: A Robustly Optimized BERT Pretraining Approach | Jul 26, 2019 | Common Sense ReasoningDocument Image Classification | CodeCode Available | 1 |
| Evaluating Gender Bias in Large Language Models | Nov 14, 2024 | Model SelectionSentence | —Unverified | 0 |
| KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication | Oct 21, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Jul 14, 2024 | Bias DetectionQuestion Answering | —Unverified | 0 |
| Ranking LLMs by compression | Jun 20, 2024 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Mixture-of-Subspaces in Low-Rank Adaptation | Jun 16, 2024 | Common Sense ReasoningImage Generation | CodeCode Available | 0 |
| Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language | May 3, 2024 | PredictionSentence | —Unverified | 0 |
| Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method | Feb 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Illuminating the Black Box: A Psychometric Investigation into the Multifaceted Nature of Large Language Models | Dec 21, 2023 | SentenceSentence Completion | —Unverified | 0 |
| LLM in a flash: Efficient Large Language Model Inference with Limited Memory | Dec 12, 2023 | CPUGPU | —Unverified | 0 |
| The Falcon Series of Open Language Models | Nov 28, 2023 | DecoderMulti-task Language Understanding | —Unverified | 0 |
| mahaNLP: A Marathi Natural Language Processing Library | Nov 5, 2023 | Hate Speech DetectionNER | CodeCode Available | 0 |
| BTRec: BERT-Based Trajectory Recommendation for Personalized Tours | Oct 30, 2023 | Language ModellingSentence | CodeCode Available | 0 |
| Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models | Sep 16, 2023 | Age/Bias-conflictingBias Detection | CodeCode Available | 0 |
| Exploiting Language Models as a Source of Knowledge for Cognitive Agents | Sep 5, 2023 | Natural Language InferenceQuestion Answering | —Unverified | 0 |
| I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection | Aug 8, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Stay on topic with Classifier-Free Guidance | Jun 30, 2023 | Code GenerationCommon Sense Reasoning | —Unverified | 0 |
| ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning | May 30, 2023 | BenchmarkingIn-Context Learning | CodeCode Available | 0 |
| PaLM 2 Technical Report | May 17, 2023 | Code GenerationCommon Sense Reasoning | CodeCode Available | 0 |
| BloombergGPT: A Large Language Model for Finance | Mar 30, 2023 | Causal JudgmentCommon Sense Reasoning | CodeCode Available | 0 |
| Numeracy from Literacy: Data Science as an Emergent Skill from Large Language Models | Jan 31, 2023 | DescriptiveFeature Importance | —Unverified | 0 |
| POIBERT: A Transformer-based Model for the Tour Recommendation Problem | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Implicit causality in GPT-2: a case study | Dec 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |