| ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal Knowledge in Large Language Models | Aug 15, 2024 | In-Context LearningMMLU | CodeCode Available | 1 |
| A deeper look at depth pruning of LLMs | Jul 23, 2024 | MMLU | CodeCode Available | 1 |
| Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs | Jul 5, 2024 | General KnowledgeInstruction Following | CodeCode Available | 1 |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Jun 25, 2024 | ARCLanguage Modeling | CodeCode Available | 1 |
| Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging | Jun 24, 2024 | MMLUModel Compression | CodeCode Available | 1 |
| Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models | Jun 23, 2024 | Machine TranslationMMLU | CodeCode Available | 1 |
| LiveMind: Low-latency Large Language Models with Simultaneous Inference | Jun 20, 2024 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning | May 28, 2024 | MMLU | CodeCode Available | 1 |
| Instruction Tuning With Loss Over Instructions | May 23, 2024 | HumanEvalMMLU | CodeCode Available | 1 |
| LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain | Apr 2, 2024 | Argument MiningDecision Making | CodeCode Available | 1 |