Ethics

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 832 papers

Title	Date	Tasks	Status	Hype
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment	Apr 13, 2023	Ethics	CodeCode Available	5
TrustLLM: Trustworthiness in Large Language Models	Jan 10, 2024	EthicsFairness	CodeCode Available	4
Visual Large Language Models for Generalized and Specialized Applications	Jan 6, 2025	Ethics	CodeCode Available	3
A Survey on Evaluation of Large Language Models	Jul 6, 2023	EthicsSurvey	CodeCode Available	3
How Can Recommender Systems Benefit from Large Language Models: A Survey	Jun 9, 2023	EthicsFeature Engineering	CodeCode Available	3
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook	Oct 11, 2024	EthicsFairness	CodeCode Available	2
PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation	Jul 8, 2024	EthicsLanguage Modeling	CodeCode Available	2
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law	May 2, 2024	DiagnosticEthics	CodeCode Available	2
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs	Feb 8, 2024	Ethics	CodeCode Available	2
Data-Centric Foundation Models in Computational Healthcare: A Survey	Jan 4, 2024	EthicsSurvey	CodeCode Available	2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher	Aug 12, 2023	EthicsRed Teaming	CodeCode Available	2
Getting pwn'd by AI: Penetration Testing with Large Language Models	Jul 24, 2023	EthicsTask Planning	CodeCode Available	2
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2
Aligning AI With Shared Human Values	Aug 5, 2020	Ethicsreinforcement-learning	CodeCode Available	2
XTRUST: On the Multilingual Trustworthiness of Large Language Models	Sep 24, 2024	EthicsFairness	CodeCode Available	1
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey	Aug 23, 2024	Ethics	CodeCode Available	1
Language Model Alignment in Multilingual Trolley Problems	Jul 2, 2024	Decision MakingEthics	CodeCode Available	1
MoralBench: Moral Evaluation of LLMs	Jun 6, 2024	Ethics	CodeCode Available	1
MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models	Mar 6, 2024	EthicsGeneral Knowledge	CodeCode Available	1
NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism	Feb 29, 2024	EthicsMultiple-choice	CodeCode Available	1
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models	Jan 29, 2024	EthicsMultiple-choice	CodeCode Available	1
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics	Oct 9, 2023	EthicsFairness	CodeCode Available	1
CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning Approaches	Sep 20, 2023	EthicsGraph Matching	CodeCode Available	1
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics	Sep 13, 2023	EthicsTruthfulQA	CodeCode Available	1
TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection	Aug 21, 2023	Anomaly DetectionAttribute	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 34Next →

All datasets ETHICS Ethics (per ethics)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RuGPT-3 Large	Accuracy	68.6	—	Unverified
2	RuGPT-3 Meduim	Accuracy	68.3	—	Unverified
3	RuGPT-3 Small	Accuracy	55.5	—	Unverified
4	Human benchmark	Accuracy	52.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Human benchmark	Accuracy	67.6	—	Unverified
2	RuGPT-3 Small	Accuracy	60.9	—	Unverified
3	RuGPT-3 Large	Accuracy	44.9	—	Unverified
4	RuGPT-3 Medium	Accuracy	44.1	—	Unverified