| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 | 5 |
| HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems | May 17, 2025 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 | 5 |
| "Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process | May 4, 2023 | Moral Scenarios | CodeCode Available | 1 | 5 |
| Evaluating the Moral Beliefs Encoded in LLMs | Jul 26, 2023 | Moral ScenariosSurvey | CodeCode Available | 1 | 5 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Measurement of LLM's Philosophies of Human Nature | Apr 3, 2025 | Moral Scenarios | CodeCode Available | 0 | 5 |
| M^3oralBench: A MultiModal Moral Benchmark for LVLMs | Dec 30, 2024 | Moral Scenarios | CodeCode Available | 0 | 5 |
| MOKA: Moral Knowledge Augmentation for Moral Event Extraction | Nov 16, 2023 | ArticlesEvent Extraction | CodeCode Available | 0 | 5 |
| SaGE: Evaluating Moral Consistency in Large Language Models | Feb 21, 2024 | Decision MakingHellaSwag | CodeCode Available | 0 | 5 |
| Measuring Moral Inconsistencies in Large Language Models | Jan 26, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Learning Tractable Probabilistic Models for Moral Responsibility and Blame | Oct 8, 2018 | Decision MakingManagement | —Unverified | 0 | 0 |
| Moral Sparks in Social Media Narratives | Oct 30, 2023 | EthicsInformativeness | —Unverified | 0 | 0 |
| Enhancing LLM Reasoning with Multi-Path Collaborative Reactive and Reflection agents | Dec 31, 2024 | Moral Scenarios | —Unverified | 0 | 0 |
| Prompt and Prejudice | Aug 7, 2024 | Decision MakingMoral Scenarios | —Unverified | 0 | 0 |
| Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning | Jun 25, 2023 | counterfactualMath | —Unverified | 0 | 0 |
| The Moral Turing Test: Evaluating Human-LLM Alignment in Moral Decision-Making | Oct 9, 2024 | Decision MakingMoral Scenarios | —Unverified | 0 | 0 |
| Fine-Tuning Language Models for Ethical Ambiguity: A Comparative Study of Alignment with Human Responses | Oct 10, 2024 | Moral Scenarios | —Unverified | 0 | 0 |