| Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts | Mar 3, 2025 | Ethics | CodeCode Available | 0 |
| None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering | Mar 3, 2025 | Business EthicsEthics | —Unverified | 0 |
| Cyber for AI at SemEval-2025 Task 4: Forgotten but Not Lost: The Balancing Act of Selective Unlearning in Large Language Models | Mar 2, 2025 | Ethics | —Unverified | 0 |
| BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge | Mar 1, 2025 | EthicsModel Selection | —Unverified | 0 |
| Mapping Trustworthiness in Large Language Models: A Bibliometric Analysis Bridging Theory to Practice | Feb 27, 2025 | EthicsFairness | —Unverified | 0 |
| Measure of Morality: A Mathematical Theory of Egalitarian Ethics | Feb 25, 2025 | EthicsPhilosophy | —Unverified | 0 |
| Dynamic LLM Routing and Selection based on User Preferences: Balancing Performance, Cost, and Ethics | Feb 23, 2025 | Ethics | —Unverified | 0 |
| Revealing the Pragmatic Dilemma for Moral Reasoning Acquisition in Language Models | Feb 23, 2025 | Ethics | —Unverified | 0 |
| Multi-Agent Risks from Advanced AI | Feb 19, 2025 | Ethics | —Unverified | 0 |
| Toward Robust Non-Transferable Learning: A Survey and Benchmark | Feb 19, 2025 | EthicsSurvey | CodeCode Available | 0 |