SOTAVerified|Agents Browse Leaderboard About

Ethics

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 781–790 of 832 papers

Title	Date	Tasks	Status	Hype
Towards a multi-stakeholder value-based assessment framework for algorithmic systems	May 9, 2022	Ethics	CodeCode Available	0
How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities	Nov 15, 2023	EthicsFairness	CodeCode Available	0
MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine Translation	Nov 2, 2022	counterfactualEthics	CodeCode Available	0
A Recommendation and Risk Classification System for Connecting Rough Sleepers to Essential Outreach Services	Jul 30, 2020	EthicsGeneral Classification	CodeCode Available	0
HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation	May 16, 2025	BenchmarkingEthics	CodeCode Available	0
Decorrelation using Optimal Transport	Jul 11, 2023	Binary ClassificationEthics	CodeCode Available	0
Surveying Professional Writers on AI: Limitations, Expectations, and Fears	Apr 7, 2025	EthicsMisinformation	CodeCode Available	0
ApplE: An Applied Ethics Ontology with Event Context	Feb 7, 2025	Ethics	CodeCode Available	0
What are People Talking about in #BlackLivesMatter and #StopAsianHate? Exploring and Categorizing Twitter Topics Emerging in Online Social Movements through the Latent Dirichlet Allocation Model	May 29, 2022	Ethics	CodeCode Available	0
Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts	Mar 3, 2025	Ethics	CodeCode Available	0

Show:10 25 50

← PrevPage 79 of 84Next →

All datasets ETHICS Ethics (per ethics)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RuGPT-3 Large	Accuracy	68.6	—	Unverified
2	RuGPT-3 Meduim	Accuracy	68.3	—	Unverified
3	RuGPT-3 Small	Accuracy	55.5	—	Unverified
4	Human benchmark	Accuracy	52.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Human benchmark	Accuracy	67.6	—	Unverified
2	RuGPT-3 Small	Accuracy	60.9	—	Unverified
3	RuGPT-3 Large	Accuracy	44.9	—	Unverified
4	RuGPT-3 Medium	Accuracy	44.1	—	Unverified