Ethics

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 832 papers

Title	Date	Tasks	Status	Hype	Score
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment	Apr 13, 2023	Ethics	CodeCode Available	5	5
TrustLLM: Trustworthiness in Large Language Models	Jan 10, 2024	EthicsFairness	CodeCode Available	4	5
A Survey on Evaluation of Large Language Models	Jul 6, 2023	EthicsSurvey	CodeCode Available	3	5
Visual Large Language Models for Generalized and Specialized Applications	Jan 6, 2025	Ethics	CodeCode Available	3	5
How Can Recommender Systems Benefit from Large Language Models: A Survey	Jun 9, 2023	EthicsFeature Engineering	CodeCode Available	3	5
Data-Centric Foundation Models in Computational Healthcare: A Survey	Jan 4, 2024	EthicsSurvey	CodeCode Available	2	5
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs	Feb 8, 2024	Ethics	CodeCode Available	2	5
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher	Aug 12, 2023	EthicsRed Teaming	CodeCode Available	2	5
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2	5
Getting pwn'd by AI: Penetration Testing with Large Language Models	Jul 24, 2023	EthicsTask Planning	CodeCode Available	2	5
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law	May 2, 2024	DiagnosticEthics	CodeCode Available	2	5
Aligning AI With Shared Human Values	Aug 5, 2020	Ethicsreinforcement-learning	CodeCode Available	2	5
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook	Oct 11, 2024	EthicsFairness	CodeCode Available	2	5
PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation	Jul 8, 2024	EthicsLanguage Modeling	CodeCode Available	2	5
Teaching Software Engineering for AI-Enabled Systems	Jan 18, 2020	EthicsFairness	CodeCode Available	1	5
Ego4D: Around the World in 3,000 Hours of Egocentric Video	Oct 13, 2021	De-identificationEthics	CodeCode Available	1	5
Ethics Sheet for Automatic Emotion Recognition and Sentiment Analysis	Sep 17, 2021	ArticlesEmotion Recognition	CodeCode Available	1	5
Worldwide AI Ethics: a review of 200 guidelines and recommendations for AI governance	Jun 23, 2022	Data VisualizationEthics	CodeCode Available	1	5
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark	Apr 6, 2023	Decision MakingEthics	CodeCode Available	1	5
Evaluating the Clinical Realism of Synthetic Chest X-Rays Generated Using Progressively Growing GANs	Oct 7, 2020	Conditional Image GenerationData Augmentation	CodeCode Available	1	5
TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection	Aug 21, 2023	Anomaly DetectionAttribute	CodeCode Available	1	5
Synthetically generated text for supervised text analysis	Mar 28, 2023	ArticlesEthics	CodeCode Available	1	5
Ethics Sheets for AI Tasks	Jul 2, 2021	ArticlesEmotion Recognition	CodeCode Available	1	5
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models	Jan 29, 2024	EthicsMultiple-choice	CodeCode Available	1	5
MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models	Mar 6, 2024	EthicsGeneral Knowledge	CodeCode Available	1	5
XTRUST: On the Multilingual Trustworthiness of Large Language Models	Sep 24, 2024	EthicsFairness	CodeCode Available	1	5
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes	Aug 20, 2020	DescriptiveEthics	CodeCode Available	1	5
Automated Kantian Ethics: A Faithful Implementation	Jul 20, 2022	Ethics	CodeCode Available	1	5
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey	Aug 23, 2024	Ethics	CodeCode Available	1	5
Can Machines Learn Morality? The Delphi Experiment	Oct 14, 2021	DescriptiveEthics	CodeCode Available	1	5
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics	Sep 13, 2023	EthicsTruthfulQA	CodeCode Available	1	5
PASS: An ImageNet replacement for self-supervised pretraining without humans	Sep 27, 2021	BenchmarkingEthics	CodeCode Available	1	5
NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism	Feb 29, 2024	EthicsMultiple-choice	CodeCode Available	1	5
Large Language Models to Identify Social Determinants of Health in Electronic Health Records	Aug 11, 2023	Adversarial RobustnessEthics	CodeCode Available	1	5
Deontological Ethics By Monotonicity Shape Constraints	Jan 31, 2020	EthicsFairness	CodeCode Available	1	5
MoralBench: Moral Evaluation of LLMs	Jun 6, 2024	Ethics	CodeCode Available	1	5
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics	Oct 9, 2023	EthicsFairness	CodeCode Available	1	5
Artificial Intelligence Ethics and Safety: practical tools for creating "good" models	Dec 14, 2021	Ethics	CodeCode Available	1	5
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N	Aug 15, 2022	EthicsMulti-agent Reinforcement Learning	CodeCode Available	1	5
CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning Approaches	Sep 20, 2023	EthicsGraph Matching	CodeCode Available	1	5
Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion models	Jun 5, 2023	Brain Tumor SegmentationEthics	CodeCode Available	1	5
Language Model Alignment in Multilingual Trolley Problems	Jul 2, 2024	Decision MakingEthics	CodeCode Available	1	5
VERB: Visualizing and Interpreting Bias Mitigation Techniques for Word Representations	Apr 6, 2021	Decision MakingDimensionality Reduction	CodeCode Available	1	5
A Framework for Understanding and Visualizing Strategies of RL Agents	Aug 17, 2022	EthicsStarcraft	CodeCode Available	0	5
Exploring and steering the moral compass of Large Language Models	May 27, 2024	AllDecision Making	CodeCode Available	0	5
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models	Oct 17, 2024	Ethics	CodeCode Available	0	5
Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model Multiplicity	Mar 14, 2022	EthicsFairness	CodeCode Available	0	5
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?	Jun 2, 2021	EthicsFew-Shot Learning	CodeCode Available	0	5
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models	Sep 19, 2024	EthicsMultiple-choice	CodeCode Available	0	5
HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation	May 16, 2025	BenchmarkingEthics	CodeCode Available	0	5

Show:10 25 50

← PrevPage 1 of 17Next →

All datasets ETHICS Ethics (per ethics)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RuGPT-3 Large	Accuracy	68.6	—	Unverified
2	RuGPT-3 Meduim	Accuracy	68.3	—	Unverified
3	RuGPT-3 Small	Accuracy	55.5	—	Unverified
4	Human benchmark	Accuracy	52.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Human benchmark	Accuracy	67.6	—	Unverified
2	RuGPT-3 Small	Accuracy	60.9	—	Unverified
3	RuGPT-3 Large	Accuracy	44.9	—	Unverified
4	RuGPT-3 Medium	Accuracy	44.1	—	Unverified