SOTAVerified

Ethics

Papers

Showing 76100 of 832 papers

TitleStatusHype
Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 20300
Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt0
AI-powered virtual eye: perspective, challenges and opportunities0
Uncertain Machine Ethics Planning0
The Cognitive Foundations of Economic Exchange: A Modular Framework Grounded in Behavioral Evidence0
The GenAI Generation: Student Views of Awareness, Preparedness, and Concern0
Securing the Future of IVR: AI-Driven Innovation with Agile Security, Data Regulation, and Ethical AI Integration0
Federated learning, ethics, and the double black box problem in medical AI0
Generative AI in Education: Student Skills and Lecturer Roles0
The Convergent Ethics of AI? Analyzing Moral Foundation Priorities in Large Language Models with a Multi-Framework Approach0
AI Ethics and Social Norms: Exploring ChatGPT's Capabilities From What to How0
Evaluation Framework for AI Systems in "the Wild"0
Approaches to Responsible Governance of GenAI in Organizations0
Achieving Distributive Justice in Federated Learning via Uncertainty QuantificationCode0
Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions0
Giving AI a voice: how does AI think it should be treated?0
FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models0
Framework, Standards, Applications and Best practices of Responsible AI : A Comprehensive Survey0
How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses0
Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models0
LOKA Protocol: A Decentralized Framework for Trustworthy and Ethical AI Agent Ecosystems0
Building Trustworthy Multimodal AI: A Review of Fairness, Transparency, and Ethics in Vision-Language Tasks0
Artificial Intelligence and the Dual Paradoxes: Examining the Interplay of Efficiency, Resource Consumption, and Labor Dynamics0
Surveying Professional Writers on AI: Limitations, Expectations, and FearsCode0
A moving target in AI-assisted decision-making: Dataset shift, model updating, and the problem of update opacity0
Show:102550
← PrevPage 4 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified