Ethics

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 832 papers

Title	Date	Tasks	Status	Hype	Score
XTRUST: On the Multilingual Trustworthiness of Large Language Models	Sep 24, 2024	EthicsFairness	CodeCode Available	1	5
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes	Aug 20, 2020	DescriptiveEthics	CodeCode Available	1	5
Automated Kantian Ethics: A Faithful Implementation	Jul 20, 2022	Ethics	CodeCode Available	1	5
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey	Aug 23, 2024	Ethics	CodeCode Available	1	5
Can Machines Learn Morality? The Delphi Experiment	Oct 14, 2021	DescriptiveEthics	CodeCode Available	1	5
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics	Sep 13, 2023	EthicsTruthfulQA	CodeCode Available	1	5
PASS: An ImageNet replacement for self-supervised pretraining without humans	Sep 27, 2021	BenchmarkingEthics	CodeCode Available	1	5
NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism	Feb 29, 2024	EthicsMultiple-choice	CodeCode Available	1	5
Large Language Models to Identify Social Determinants of Health in Electronic Health Records	Aug 11, 2023	Adversarial RobustnessEthics	CodeCode Available	1	5
Deontological Ethics By Monotonicity Shape Constraints	Jan 31, 2020	EthicsFairness	CodeCode Available	1	5
MoralBench: Moral Evaluation of LLMs	Jun 6, 2024	Ethics	CodeCode Available	1	5
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics	Oct 9, 2023	EthicsFairness	CodeCode Available	1	5
Artificial Intelligence Ethics and Safety: practical tools for creating "good" models	Dec 14, 2021	Ethics	CodeCode Available	1	5
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N	Aug 15, 2022	EthicsMulti-agent Reinforcement Learning	CodeCode Available	1	5
CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning Approaches	Sep 20, 2023	EthicsGraph Matching	CodeCode Available	1	5
Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion models	Jun 5, 2023	Brain Tumor SegmentationEthics	CodeCode Available	1	5
Language Model Alignment in Multilingual Trolley Problems	Jul 2, 2024	Decision MakingEthics	CodeCode Available	1	5
VERB: Visualizing and Interpreting Bias Mitigation Techniques for Word Representations	Apr 6, 2021	Decision MakingDimensionality Reduction	CodeCode Available	1	5
A Framework for Understanding and Visualizing Strategies of RL Agents	Aug 17, 2022	EthicsStarcraft	CodeCode Available	0	5
Exploring and steering the moral compass of Large Language Models	May 27, 2024	AllDecision Making	CodeCode Available	0	5
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models	Oct 17, 2024	Ethics	CodeCode Available	0	5
Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model Multiplicity	Mar 14, 2022	EthicsFairness	CodeCode Available	0	5
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?	Jun 2, 2021	EthicsFew-Shot Learning	CodeCode Available	0	5
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models	Sep 19, 2024	EthicsMultiple-choice	CodeCode Available	0	5
HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation	May 16, 2025	BenchmarkingEthics	CodeCode Available	0	5

Show:10 25 50

← PrevPage 2 of 34Next →

All datasets ETHICS Ethics (per ethics)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RuGPT-3 Large	Accuracy	68.6	—	Unverified
2	RuGPT-3 Meduim	Accuracy	68.3	—	Unverified
3	RuGPT-3 Small	Accuracy	55.5	—	Unverified
4	Human benchmark	Accuracy	52.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Human benchmark	Accuracy	67.6	—	Unverified
2	RuGPT-3 Small	Accuracy	60.9	—	Unverified
3	RuGPT-3 Large	Accuracy	44.9	—	Unverified
4	RuGPT-3 Medium	Accuracy	44.1	—	Unverified