SOTAVerified

Ethics

Papers

Showing 801832 of 832 papers

TitleStatusHype
On Measuring Gender Bias in Translation of Gender-neutral PronounsCode0
Zero-shot Visual Commonsense Immorality PredictionCode0
Progressive Generalization Risk Reduction for Data-Efficient Causal Effect EstimationCode0
Uncertain Machine Ethical Decisions Using Hypothetical RetrospectionCode0
A Logic-based Multi-agent System for Ethical Monitoring and Evaluation of DialoguesCode0
Teaching Tech to Talk: K-12 Conversational Artificial Intelligence Literacy Curriculum and Development ToolsCode0
A Framework for Understanding and Visualizing Strategies of RL AgentsCode0
Beyond Labels: Aligning Large Language Models with Human-like ReasoningCode0
Some Issues in Predictive Ethics Modeling: An Annotated Contrast Set of "Moral Stories"Code0
Achieving Distributive Justice in Federated Learning via Uncertainty QuantificationCode0
Responsible Design Patterns for Machine Learning PipelinesCode0
Toward Robust Non-Transferable Learning: A Survey and BenchmarkCode0
AI Ethics on Blockchain: Topic Analysis on Twitter Data for Blockchain SecurityCode0
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?Code0
CleftGAN: Adapting A Style-Based Generative Adversarial Network To Create Images Depicting Cleft Lip DeformityCode0
Risks from Language Models for Automated Mental Healthcare: Ethics and Structure for ImplementationCode0
A Taxation Perspective for Fair Re-rankingCode0
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language ModelsCode0
Mining Disinformation and Fake News: Concepts, Methods, and Recent AdvancementsCode0
The Only Way is Ethics: A Guide to Ethical Research with Large Language ModelsCode0
RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert CollaborationCode0
Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model MultiplicityCode0
Modeling Emotions and Ethics with Large Language ModelsCode0
Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection FrameworkCode0
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?Code0
Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and ClaudeCode0
Towards Effective Paraphrasing for Information DisguiseCode0
MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active LearningCode0
Towards Empathic Deep Q-LearningCode0
Defining a Sandbox for Responsible AICode0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty SimulationsCode0
Show:102550
← PrevPage 17 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified