SOTAVerified

Ethics

Papers

Showing 150 of 832 papers

TitleStatusHype
RAFT: Reward rAnked FineTuning for Generative Foundation Model AlignmentCode5
TrustLLM: Trustworthiness in Large Language ModelsCode4
A Survey on Evaluation of Large Language ModelsCode3
Visual Large Language Models for Generalized and Specialized ApplicationsCode3
How Can Recommender Systems Benefit from Large Language Models: A SurveyCode3
Data-Centric Foundation Models in Computational Healthcare: A SurveyCode2
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMsCode2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Getting pwn'd by AI: Penetration Testing with Large Language ModelsCode2
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and LawCode2
Aligning AI With Shared Human ValuesCode2
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 OutlookCode2
PsycoLLM: Enhancing LLM for Psychological Understanding and EvaluationCode2
Teaching Software Engineering for AI-Enabled SystemsCode1
Ego4D: Around the World in 3,000 Hours of Egocentric VideoCode1
Ethics Sheet for Automatic Emotion Recognition and Sentiment AnalysisCode1
Worldwide AI Ethics: a review of 200 guidelines and recommendations for AI governanceCode1
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI BenchmarkCode1
Evaluating the Clinical Realism of Synthetic Chest X-Rays Generated Using Progressively Growing GANsCode1
TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly DetectionCode1
Synthetically generated text for supervised text analysisCode1
Ethics Sheets for AI TasksCode1
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language ModelsCode1
MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language ModelsCode1
XTRUST: On the Multilingual Trustworthiness of Large Language ModelsCode1
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life AnecdotesCode1
Automated Kantian Ethics: A Faithful ImplementationCode1
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive SurveyCode1
Can Machines Learn Morality? The Delphi ExperimentCode1
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and EthicsCode1
PASS: An ImageNet replacement for self-supervised pretraining without humansCode1
NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese JournalismCode1
Large Language Models to Identify Social Determinants of Health in Electronic Health RecordsCode1
Deontological Ethics By Monotonicity Shape ConstraintsCode1
MoralBench: Moral Evaluation of LLMsCode1
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsCode1
Artificial Intelligence Ethics and Safety: practical tools for creating "good" modelsCode1
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-NCode1
CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning ApproachesCode1
Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion modelsCode1
Language Model Alignment in Multilingual Trolley ProblemsCode1
VERB: Visualizing and Interpreting Bias Mitigation Techniques for Word RepresentationsCode1
A Framework for Understanding and Visualizing Strategies of RL AgentsCode0
Exploring and steering the moral compass of Large Language ModelsCode0
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language ModelsCode0
Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model MultiplicityCode0
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?Code0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
HumaniBench: A Human-Centric Framework for Large Multimodal Models EvaluationCode0
Show:102550
← PrevPage 1 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified