SOTAVerified

Ethics

Papers

Showing 5175 of 832 papers

TitleStatusHype
Modeling Emotions and Ethics with Large Language ModelsCode0
More RLHF, More Trust? On The Impact of Preference Alignment On TrustworthinessCode0
MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active LearningCode0
Informed AI Regulation: Comparing the Ethical Frameworks of Leading LLM Chatbots Using an Ethics-Based Audit to Assess Moral Reasoning and Normative ValuesCode0
Learning From Revisions: Quality Assessment of Claims in Argumentation at ScaleCode0
HumaniBench: A Human-Centric Framework for Large Multimodal Models EvaluationCode0
How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their VulnerabilitiesCode0
Learning Human Action Recognition Representations Without Real HumansCode0
MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine TranslationCode0
Semantics derived automatically from language corpora contain human-like biasesCode0
RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert CollaborationCode0
Responsible Design Patterns for Machine Learning PipelinesCode0
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language ModelsCode0
Exploring and steering the moral compass of Large Language ModelsCode0
Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model MultiplicityCode0
ACL Ready: RAG Based Assistant for the ACL ChecklistCode0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
Achieving Distributive Justice in Federated Learning via Uncertainty QuantificationCode0
EALM: Introducing Multidimensional Ethical Alignment in Conversational Information RetrievalCode0
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?Code0
Decorrelation using Optimal TransportCode0
Data Defenses Against Large Language ModelsCode0
Defining a Sandbox for Responsible AICode0
CleftGAN: Adapting A Style-Based Generative Adversarial Network To Create Images Depicting Cleft Lip DeformityCode0
Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and ClaudeCode0
Show:102550
← PrevPage 3 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified