SOTAVerified

Ethics

Papers

Showing 150 of 832 papers

TitleStatusHype
RAFT: Reward rAnked FineTuning for Generative Foundation Model AlignmentCode5
TrustLLM: Trustworthiness in Large Language ModelsCode4
Visual Large Language Models for Generalized and Specialized ApplicationsCode3
A Survey on Evaluation of Large Language ModelsCode3
How Can Recommender Systems Benefit from Large Language Models: A SurveyCode3
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 OutlookCode2
PsycoLLM: Enhancing LLM for Psychological Understanding and EvaluationCode2
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and LawCode2
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMsCode2
Data-Centric Foundation Models in Computational Healthcare: A SurveyCode2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherCode2
Getting pwn'd by AI: Penetration Testing with Large Language ModelsCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Aligning AI With Shared Human ValuesCode2
XTRUST: On the Multilingual Trustworthiness of Large Language ModelsCode1
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive SurveyCode1
Language Model Alignment in Multilingual Trolley ProblemsCode1
MoralBench: Moral Evaluation of LLMsCode1
MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language ModelsCode1
NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese JournalismCode1
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language ModelsCode1
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsCode1
CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning ApproachesCode1
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and EthicsCode1
TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly DetectionCode1
Large Language Models to Identify Social Determinants of Health in Electronic Health RecordsCode1
Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion modelsCode1
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI BenchmarkCode1
Synthetically generated text for supervised text analysisCode1
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-NCode1
Automated Kantian Ethics: A Faithful ImplementationCode1
Worldwide AI Ethics: a review of 200 guidelines and recommendations for AI governanceCode1
Artificial Intelligence Ethics and Safety: practical tools for creating "good" modelsCode1
Can Machines Learn Morality? The Delphi ExperimentCode1
Ego4D: Around the World in 3,000 Hours of Egocentric VideoCode1
PASS: An ImageNet replacement for self-supervised pretraining without humansCode1
Ethics Sheet for Automatic Emotion Recognition and Sentiment AnalysisCode1
Ethics Sheets for AI TasksCode1
VERB: Visualizing and Interpreting Bias Mitigation Techniques for Word RepresentationsCode1
Evaluating the Clinical Realism of Synthetic Chest X-Rays Generated Using Progressively Growing GANsCode1
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life AnecdotesCode1
Deontological Ethics By Monotonicity Shape ConstraintsCode1
Teaching Software Engineering for AI-Enabled SystemsCode1
The Ethical Implications of AI in Creative Industries: A Focus on AI-Generated Art0
Feeling Machines: Ethics, Culture, and the Rise of Emotional AI0
Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics0
SocialCredit+0
"I Hadn't Thought About That": Creators of Human-like AI Weigh in on Ethics And Neurodivergence0
Extended Creativity: A Conceptual Framework for Understanding Human-AI Creative Relations0
MIRA: Medical Time Series Foundation Model for Real-World Health Data0
Show:102550
← PrevPage 1 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified