| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 | 5 |
| HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems | May 17, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| "Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process | May 4, 2023 | Moral Scenarios | CodeCode Available | 1 | 5 |
| Evaluating the Moral Beliefs Encoded in LLMs | Jul 26, 2023 | Moral ScenariosSurvey | CodeCode Available | 1 | 5 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Measurement of LLM's Philosophies of Human Nature | Apr 3, 2025 | Moral Scenarios | CodeCode Available | 0 | 5 |
| M^3oralBench: A MultiModal Moral Benchmark for LVLMs | Dec 30, 2024 | Moral Scenarios | CodeCode Available | 0 | 5 |
| MOKA: Moral Knowledge Augmentation for Moral Event Extraction | Nov 16, 2023 | ArticlesEvent Extraction | CodeCode Available | 0 | 5 |
| SaGE: Evaluating Moral Consistency in Large Language Models | Feb 21, 2024 | Decision MakingHellaSwag | CodeCode Available | 0 | 5 |
| Measuring Moral Inconsistencies in Large Language Models | Jan 26, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |