SOTAVerified

Ethics

Papers

Showing 76100 of 832 papers

TitleStatusHype
SciFi-Benchmark: How Would AI-Powered Robots Behave in Science Fiction Literature?0
Hedonic Adaptation in the Age of AI: A Perspective on Diminishing Satisfaction Returns in Technology Adoption0
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies0
Dubito Ergo Sum: Exploring AI Ethics0
Decoding the Black Box: Integrating Moral Imagination with Technical AI Governance0
Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering PromptsCode0
None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering0
Cyber for AI at SemEval-2025 Task 4: Forgotten but Not Lost: The Balancing Act of Selective Unlearning in Large Language Models0
BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge0
Mapping Trustworthiness in Large Language Models: A Bibliometric Analysis Bridging Theory to Practice0
Measure of Morality: A Mathematical Theory of Egalitarian Ethics0
Revealing the Pragmatic Dilemma for Moral Reasoning Acquisition in Language Models0
Dynamic LLM Routing and Selection based on User Preferences: Balancing Performance, Cost, and Ethics0
Multi-Agent Risks from Advanced AI0
The 20 Laws of AI Power: Mastering the Future of Autonomous Intelligence0
Toward Robust Non-Transferable Learning: A Survey and BenchmarkCode0
Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making0
Educating a Responsible AI Workforce: Piloting a Curricular Module on AI Policy in a Graduate Machine Learning Course0
Fairness in Agentic AI: A Unified Framework for Ethical and Equitable Multi-Agent System0
Coarse Set Theory for AI Ethics and Decision-Making: A Mathematical Framework for Granular Evaluations0
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?Code0
ApplE: An Applied Ethics Ontology with Event ContextCode0
Ethical Considerations for the Military Use of Artificial Intelligence in Visual Reconnaissance0
Control Search Rankings, Control the World: What is a Good Search Engine?0
Superhuman Game AI Disclosure: Expertise and Context Moderate Effects on Trust and Fairness0
Show:102550
← PrevPage 4 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified