SOTAVerified|Agents Browse Leaderboard About Blog

Ethics

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 81–90 of 832 papers

Title	Date	Tasks	Status	Hype
Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts	Mar 3, 2025	Ethics	CodeCode Available	0
None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering	Mar 3, 2025	Business EthicsEthics	—Unverified	0
Cyber for AI at SemEval-2025 Task 4: Forgotten but Not Lost: The Balancing Act of Selective Unlearning in Large Language Models	Mar 2, 2025	Ethics	—Unverified	0
BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge	Mar 1, 2025	EthicsModel Selection	—Unverified	0
Mapping Trustworthiness in Large Language Models: A Bibliometric Analysis Bridging Theory to Practice	Feb 27, 2025	EthicsFairness	—Unverified	0
Measure of Morality: A Mathematical Theory of Egalitarian Ethics	Feb 25, 2025	EthicsPhilosophy	—Unverified	0
Dynamic LLM Routing and Selection based on User Preferences: Balancing Performance, Cost, and Ethics	Feb 23, 2025	Ethics	—Unverified	0
Revealing the Pragmatic Dilemma for Moral Reasoning Acquisition in Language Models	Feb 23, 2025	Ethics	—Unverified	0
Multi-Agent Risks from Advanced AI	Feb 19, 2025	Ethics	—Unverified	0
Toward Robust Non-Transferable Learning: A Survey and Benchmark	Feb 19, 2025	EthicsSurvey	CodeCode Available	0

Show:10 25 50

← PrevPage 9 of 84Next →

All datasets ETHICS Ethics (per ethics)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RuGPT-3 Large	Accuracy	68.6	—	Unverified
2	RuGPT-3 Meduim	Accuracy	68.3	—	Unverified
3	RuGPT-3 Small	Accuracy	55.5	—	Unverified
4	Human benchmark	Accuracy	52.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Human benchmark	Accuracy	67.6	—	Unverified
2	RuGPT-3 Small	Accuracy	60.9	—	Unverified
3	RuGPT-3 Large	Accuracy	44.9	—	Unverified
4	RuGPT-3 Medium	Accuracy	44.1	—	Unverified