SOTAVerified

Ethics

Papers

Showing 151175 of 832 papers

TitleStatusHype
Awes, Laws, and Flaws From Today's LLM Research0
XXAI: Towards eXplicitly eXplainable Artificial Intelligence0
A Word on Machine Ethics: A Response to Jiang et al. (2021)0
Balancing Innovation and Ethics in AI-Driven Software Development0
Balancing Power and Ethics: A Framework for Addressing Human Rights Concerns in Military AI0
Balancing Progress and Responsibility: A Synthesis of Sustainability Trade-Offs of AI-Based Systems0
Basic principles and concept design of a real-time clinical decision support system for managing medical emergencies on missions to Mars0
BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models0
Behavior Matters: An Alternative Perspective on Promoting Responsible Data Science0
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies0
Benchmarking Deepart Detection0
Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 20300
Beneficent Intelligence: A Capability Approach to Modeling Benefit, Assistance, and Associated Moral Failures through AI Systems0
Applying Transparency in Artificial Intelligence based Personalization Systems0
Best Practices in the Creation and Use of Emotion Lexicons0
Beyond Accuracy: A Critical Review of Fairness in Machine Learning for Mobile and Wearable Computing0
Beyond Bias and Compliance: Towards Individual Agency and Plurality of Ethics in AI0
Beyond Fairness Metrics: Roadblocks and Challenges for Ethical AI in Practice0
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing0
Beyond Keywords: A Context-based Hybrid Approach to Mining Ethical Concern-related App Reviews0
An Audit Framework for Adopting AI-Nudging on Children0
Beyond Near- and Long-Term: Towards a Clearer Account of Research Priorities in AI Ethics and Society0
Beyond Single-Sentence Prompts: Upgrading Value Alignment Benchmarks with Dialogues and Stories0
Towards interactive evaluations for interaction harms in human-AI systems0
A Universal Knowledge Model and Cognitive Architecture for Prototyping AGI0
Show:102550
← PrevPage 7 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified