SOTAVerified

Ethics

Papers

Showing 150 of 832 papers

TitleStatusHype
The Ethical Implications of AI in Creative Industries: A Focus on AI-Generated Art0
Feeling Machines: Ethics, Culture, and the Rise of Emotional AI0
Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics0
"I Hadn't Thought About That": Creators of Human-like AI Weigh in on Ethics And Neurodivergence0
SocialCredit+0
Extended Creativity: A Conceptual Framework for Understanding Human-AI Creative Relations0
MIRA: Medical Time Series Foundation Model for Real-World Health Data0
Surgeons Awareness, Expectations, and Involvement with Artificial Intelligence: a Survey Pre and Post the GPT Era0
FG 2025 TrustFAA: the First Workshop on Towards Trustworthy Facial Affect Analysis: Advancing Insights of Fairness, Explainability, and Safety (TrustFAA)0
A Comprehensive Study on Medical Image Segmentation using Deep Neural Networks0
Multi Layered Autonomy and AI Ecologies in Robotic Art Installations0
HADA: Human-AI Agent Decision Alignment Architecture0
Higher-Order Responsibility0
Position: Olfaction Standardization is Essential for the Advancement of Embodied Artificial Intelligence0
Bottom-Up Perspectives on AI Governance: Insights from User Reviews of AI Products0
Responsible Data Stewardship: Generative AI and the Digital Waste Problem0
My Answer Is NOT 'Fair': Mitigating Social Bias in Vision-Language Models via Fair and Biased Residuals0
Ten Principles of AI Agent Economics0
When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social DilemmasCode0
Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models0
The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas0
A Toolkit for Compliance, a Toolkit for Justice: Drawing on Cross-sectoral Expertise to Develop a Pro-justice EU AI Act Toolkit0
AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals0
A Participatory Strategy for AI Ethics in Education and Rehabilitation grounded in the Capability Approach0
Internal and External Impacts of Natural Language Processing Papers0
Inter(sectional) Alia(s): Ambiguity in Voice Agent Identity via Intersectional Japanese Self-Referents0
More-than-Human Storytelling: Designing Longitudinal Narrative Engagements with Generative AI0
Exploring Moral Exercises for Human Oversight of AI systems: Insights from Three Pilot Studies0
Kaleidoscope Gallery: Exploring Ethics and Generative AI Through Art0
HumaniBench: A Human-Centric Framework for Large Multimodal Models EvaluationCode0
Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach0
Clicking some of the silly options: Exploring Player Motivation in Static and Dynamic Educational Interactive Narratives0
Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 20300
Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt0
AI-powered virtual eye: perspective, challenges and opportunities0
Uncertain Machine Ethics Planning0
The Cognitive Foundations of Economic Exchange: A Modular Framework Grounded in Behavioral Evidence0
The GenAI Generation: Student Views of Awareness, Preparedness, and Concern0
Securing the Future of IVR: AI-Driven Innovation with Agile Security, Data Regulation, and Ethical AI Integration0
Federated learning, ethics, and the double black box problem in medical AI0
Generative AI in Education: Student Skills and Lecturer Roles0
The Convergent Ethics of AI? Analyzing Moral Foundation Priorities in Large Language Models with a Multi-Framework Approach0
AI Ethics and Social Norms: Exploring ChatGPT's Capabilities From What to How0
Approaches to Responsible Governance of GenAI in Organizations0
Evaluation Framework for AI Systems in "the Wild"0
Achieving Distributive Justice in Federated Learning via Uncertainty QuantificationCode0
Giving AI a voice: how does AI think it should be treated?0
Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions0
FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models0
Framework, Standards, Applications and Best practices of Responsible AI : A Comprehensive Survey0
Show:102550
← PrevPage 1 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified