| Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment | Aug 18, 2023 | MMLURed Teaming | CodeCode Available | 1 | 5 |
| LM2: Large Memory Models | Feb 9, 2025 | DecoderMMLU | CodeCode Available | 1 | 5 |
| An Open Source Data Contamination Report for Large Language Models | Oct 26, 2023 | HellaSwagLanguage Modeling | CodeCode Available | 1 | 5 |
| Efficient Online Data Mixing For Language Model Pre-Training | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Prompt Optimization via Adversarial In-Context Learning | Dec 5, 2023 | Arithmetic ReasoningData-to-Text Generation | CodeCode Available | 1 | 5 |
| Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language Models | May 19, 2025 | BenchmarkingChatbot | CodeCode Available | 1 | 5 |
| LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain | Apr 2, 2024 | Argument MiningDecision Making | CodeCode Available | 1 | 5 |
| Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging | Jun 24, 2024 | MMLUModel Compression | CodeCode Available | 1 | 5 |
| CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning | Oct 14, 2024 | MathMathematical Reasoning | CodeCode Available | 1 | 5 |
| Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models | Jun 23, 2024 | Machine TranslationMMLU | CodeCode Available | 1 | 5 |