SOTAVerified

Ethics

Papers

Showing 126150 of 832 papers

TitleStatusHype
Cyber for AI at SemEval-2025 Task 4: Forgotten but Not Lost: The Balancing Act of Selective Unlearning in Large Language Models0
BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge0
Mapping Trustworthiness in Large Language Models: A Bibliometric Analysis Bridging Theory to Practice0
Measure of Morality: A Mathematical Theory of Egalitarian Ethics0
Revealing the Pragmatic Dilemma for Moral Reasoning Acquisition in Language Models0
Dynamic LLM Routing and Selection based on User Preferences: Balancing Performance, Cost, and Ethics0
Multi-Agent Risks from Advanced AI0
The 20 Laws of AI Power: Mastering the Future of Autonomous Intelligence0
Toward Robust Non-Transferable Learning: A Survey and BenchmarkCode0
Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making0
Coarse Set Theory for AI Ethics and Decision-Making: A Mathematical Framework for Granular Evaluations0
Fairness in Agentic AI: A Unified Framework for Ethical and Equitable Multi-Agent System0
Educating a Responsible AI Workforce: Piloting a Curricular Module on AI Policy in a Graduate Machine Learning Course0
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?Code0
ApplE: An Applied Ethics Ontology with Event ContextCode0
Control Search Rankings, Control the World: What is a Good Search Engine?0
Ethical Considerations for the Military Use of Artificial Intelligence in Visual Reconnaissance0
Superhuman Game AI Disclosure: Expertise and Context Moderate Effects on Trust and Fairness0
Unlocking the Black Box: Analysing the EU Artificial Intelligence Act's Framework for Explainability in AI0
Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s)0
Development of Application-Specific Large Language Models to Facilitate Research Ethics Review0
Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and ClaudeCode0
How to Avoid Both the Repugnant and Sadistic Conclusions without Dropping Standard Axioms in Population Economics0
AI Toolkit: Libraries and Essays for Exploring the Technology and Ethics of AI0
Stylomech: Unveiling Authorship via Computational Stylometry in English and Romanized Sinhala0
Show:102550
← PrevPage 6 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified