SOTAVerified

Ethics

Papers

Showing 101125 of 832 papers

TitleStatusHype
Systematic Literature Review: Explainable AI Definitions and Challenges in Education0
Trapped by Expectations: Functional Fixedness in LLM-Enabled Chat Search0
The role of ethical consumption in promoting democratic sustainability: revisiting neoclassical economics through Kantian ethics0
AI Regulation and Capitalist Growth: Balancing Innovation, Ethics, and Global Governance0
BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models0
Are clinicians ethically obligated to disclose their use of medical machine learning systems to patients?0
When LLM Therapists Become Salespeople: Evaluating Large Language Models for Ethical Motivational Interviewing0
e-person Architecture and Framework for Human-AI Co-adventure Relationship0
Beyond Single-Sentence Prompts: Upgrading Value Alignment Benchmarks with Dialogues and Stories0
AI Identity, Empowerment, and Mindfulness in Mitigating Unethical AI Use0
Computational Thinking with Computer Vision: Developing AI Competency in an Introductory Computer Science Course0
Towards Responsible AI Music: an Investigation of Trustworthy Features for Creative Systems0
Three Kinds of AI Ethics0
CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings0
Generative AI for Software Architecture. Applications, Trends, Challenges, and Future Directions0
Privacy Ethics Alignment in AI: A Stakeholder-Centric Based Framework for Ethical AI0
Ethical AI for Young Digital Citizens: A Call to Action on Privacy Governance0
MinorBench: A hand-built benchmark for content-based risks for children0
SciFi-Benchmark: How Would AI-Powered Robots Behave in Science Fiction Literature?0
Hedonic Adaptation in the Age of AI: A Perspective on Diminishing Satisfaction Returns in Technology Adoption0
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies0
Decoding the Black Box: Integrating Moral Imagination with Technical AI Governance0
Dubito Ergo Sum: Exploring AI Ethics0
Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering PromptsCode0
None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering0
Show:102550
← PrevPage 5 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified