SOTAVerified

Ethics

Papers

Showing 251275 of 832 papers

TitleStatusHype
Ethical considerations when planning, implementing and releasing health economic model software: a new proposal0
Towards a Formalisation of Value-based Actions and Consequentialist Ethics0
Specifying Agent Ethics (Blue Sky Ideas)0
The Ethics of AI in Education0
Antisocial Analagous Behavior, Alignment and Human Impact of Google AI Systems: Evaluating through the lens of modified Antisocial Behavior Criteria by Human Interaction, Independent LLM Analysis, and AI Self-Reflection0
Protected group bias and stereotypes in Large Language Models0
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression0
Embracing the Generative AI Revolution: Advancing Tertiary Education in Cybersecurity with GPT0
Safeguarding Marketing Research: The Generation, Identification, and Mitigation of AI-Fabricated Disinformation0
Evaluation Ethics of LLMs in Legal Domain0
Data Ethics Emergency Drill: A Toolbox for Discussing Responsible AI for Industry Teams0
AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic0
A Moral Imperative: The Need for Continual Superalignment of Large Language Models0
AI Ethics: A Bibliometric Analysis, Critical Issues, and Key Gaps0
Legally Binding but Unfair? Towards Assessing Fairness of Privacy Policies0
Responsible Artificial Intelligence: A Structured Literature Review0
And Then the Hammer Broke: Reflections on Machine Ethics from Feminist Philosophy of Science0
MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language ModelsCode1
The Case for Animal-Friendly AI0
NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese JournalismCode1
Artificial Intelligence and Diabetes Mellitus: An Inside Look Through the Retina0
Exploring ChatGPT and its Impact on Society0
AI Ethics and Governance in Practice: An Introduction0
AI Fairness in Practice0
Uncovering Latent Human Wellbeing in Language Model Embeddings0
Show:102550
← PrevPage 11 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified