| Evaluating Gender Bias in Large Language Models | Nov 14, 2024 | Model SelectionSentence | —Unverified | 0 |
| KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication | Oct 21, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Jul 14, 2024 | Bias DetectionQuestion Answering | —Unverified | 0 |
| Ranking LLMs by compression | Jun 20, 2024 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Mixture-of-Subspaces in Low-Rank Adaptation | Jun 16, 2024 | Common Sense ReasoningImage Generation | CodeCode Available | 0 |
| Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language | May 3, 2024 | PredictionSentence | —Unverified | 0 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method | Feb 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Jan 5, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 2 |
| Illuminating the Black Box: A Psychometric Investigation into the Multifaceted Nature of Large Language Models | Dec 21, 2023 | SentenceSentence Completion | —Unverified | 0 |
| LLM in a flash: Efficient Large Language Model Inference with Limited Memory | Dec 12, 2023 | CPUGPU | —Unverified | 0 |
| Mamba: Linear-Time Sequence Modeling with Selective State Spaces | Dec 1, 2023 | 2D Pose EstimationCommon Sense Reasoning | CodeCode Available | 6 |
| The Falcon Series of Open Language Models | Nov 28, 2023 | DecoderMulti-task Language Understanding | —Unverified | 0 |
| mahaNLP: A Marathi Natural Language Processing Library | Nov 5, 2023 | Hate Speech DetectionNER | CodeCode Available | 0 |
| BTRec: BERT-Based Trajectory Recommendation for Personalized Tours | Oct 30, 2023 | Language ModellingSentence | CodeCode Available | 0 |
| Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models | Sep 16, 2023 | Age/Bias-conflictingBias Detection | CodeCode Available | 0 |
| Exploiting Language Models as a Source of Knowledge for Cognitive Agents | Sep 5, 2023 | Natural Language InferenceQuestion Answering | —Unverified | 0 |
| I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection | Aug 8, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Llama 2: Open Foundation and Fine-Tuned Chat Models | Jul 18, 2023 | Arithmetic Reasoning | CodeCode Available | 8 |
| Stay on topic with Classifier-Free Guidance | Jun 30, 2023 | Code GenerationCommon Sense Reasoning | —Unverified | 0 |
| ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning | May 30, 2023 | BenchmarkingIn-Context Learning | CodeCode Available | 0 |
| The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning | May 23, 2023 | Common Sense ReasoningCommon Sense Reasoning (Zero-Shot) | CodeCode Available | 2 |
| PaLM 2 Technical Report | May 17, 2023 | Code GenerationCommon Sense Reasoning | CodeCode Available | 0 |
| LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions | Apr 27, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| BloombergGPT: A Large Language Model for Finance | Mar 30, 2023 | Causal JudgmentCommon Sense Reasoning | CodeCode Available | 0 |
| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| LLaMA: Open and Efficient Foundation Language Models | Feb 27, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 7 |
| Exploring the Benefits of Training Expert Language Models over Instruction Tuning | Feb 7, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Numeracy from Literacy: Data Science as an Emergent Skill from Large Language Models | Jan 31, 2023 | DescriptiveFeature Importance | —Unverified | 0 |
| POIBERT: A Transformer-based Model for the Tour Recommendation Problem | Dec 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Implicit causality in GPT-2: a case study | Dec 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Crosslingual Generalization through Multitask Finetuning | Nov 3, 2022 | Coreference ResolutionCross-Lingual Transfer | CodeCode Available | 2 |
| Two is Better than Many? Binary Classification as an Effective Approach to Multi-Choice Question Answering | Oct 29, 2022 | Binary ClassificationQuestion Answering | CodeCode Available | 1 |
| Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models | Oct 28, 2022 | Common Sense ReasoningCoreference Resolution | —Unverified | 0 |
| DiscoSense: Commonsense Reasoning with Discourse Connectives | Oct 22, 2022 | Sentence Completion | CodeCode Available | 0 |
| Task Compass: Scaling Multi-task Pre-training with Task Prefix | Oct 12, 2022 | Common Sense ReasoningData Augmentation | CodeCode Available | 1 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| Effidit: Your AI Writing Assistant | Aug 3, 2022 | Keywords to SentencesRetrieval | —Unverified | 0 |
| SC-Ques: A Sentence Completion Question Dataset for English as a Second Language Learners | Jun 24, 2022 | SentenceSentence Completion | CodeCode Available | 0 |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Jun 9, 2022 | MisconceptionsSentence | CodeCode Available | 5 |
| Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals | May 1, 2022 | SentenceSentence Completion | CodeCode Available | 1 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 |
| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| Efficient Language Modeling with Sparse all-MLP | Mar 14, 2022 | AllCommon Sense Reasoning | —Unverified | 0 |
| Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model | Jan 28, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 3 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| SeqPATE: Differentially Private Text Generation via Knowledge Distillation | Sep 29, 2021 | Knowledge DistillationSentence | —Unverified | 0 |
| Language Models as a Knowledge Source for Cognitive Agents | Sep 17, 2021 | Language ModellingNatural Language Inference | —Unverified | 0 |