SOTAVerified|Agents Browse Leaderboard About Blog

Ethics

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 832 papers

Title	Date	Tasks	Status	Hype	Score
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher	Aug 12, 2023	EthicsRed Teaming	CodeCode Available	2	5
Getting pwn'd by AI: Penetration Testing with Large Language Models	Jul 24, 2023	EthicsTask Planning	CodeCode Available	2	5
Data-Centric Foundation Models in Computational Healthcare: A Survey	Jan 4, 2024	EthicsSurvey	CodeCode Available	2	5
PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation	Jul 8, 2024	EthicsLanguage Modeling	CodeCode Available	2	5
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics	Oct 9, 2023	EthicsFairness	CodeCode Available	1	5
Artificial Intelligence Ethics and Safety: practical tools for creating "good" models	Dec 14, 2021	Ethics	CodeCode Available	1	5
Ethics Sheet for Automatic Emotion Recognition and Sentiment Analysis	Sep 17, 2021	ArticlesEmotion Recognition	CodeCode Available	1	5
Ethics Sheets for AI Tasks	Jul 2, 2021	ArticlesEmotion Recognition	CodeCode Available	1	5
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark	Apr 6, 2023	Decision MakingEthics	CodeCode Available	1	5
Can Machines Learn Morality? The Delphi Experiment	Oct 14, 2021	DescriptiveEthics	CodeCode Available	1	5

Show:10 25 50

← PrevPage 2 of 84Next →

All datasets ETHICS Ethics (per ethics)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RuGPT-3 Large	Accuracy	68.6	—	Unverified
2	RuGPT-3 Meduim	Accuracy	68.3	—	Unverified
3	RuGPT-3 Small	Accuracy	55.5	—	Unverified
4	Human benchmark	Accuracy	52.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Human benchmark	Accuracy	67.6	—	Unverified
2	RuGPT-3 Small	Accuracy	60.9	—	Unverified
3	RuGPT-3 Large	Accuracy	44.9	—	Unverified
4	RuGPT-3 Medium	Accuracy	44.1	—	Unverified