SOTAVerified

Ethics

Papers

Showing 201225 of 832 papers

TitleStatusHype
Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructions0
Students' Perceptions and Use of Generative AI Tools for Programming Across Different Computing Courses0
Behavior Matters: An Alternative Perspective on Promoting Responsible Data Science0
Moral Alignment for LLM Agents0
Tesla's Autopilot: Ethics and Tragedy0
RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert CollaborationCode0
Brain Surgery: Ensuring GDPR Compliance in Large Language Models via Concept Erasure0
Generative AI Carries Non-Democratic Biases and Stereotypes: Representation of Women, Black Individuals, Age Groups, and People with Disability in AI-Generated Images across Occupations0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs0
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications0
Generative AI for Requirements Engineering: A Systematic Literature Review0
Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric Conditions0
Declarative Integration and Management of Large Language Models through Finite Automata: Application to Automation, Communication, and Ethics0
3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos0
Can Large Language Models Replace Human Subjects? A Large-Scale Replication of Scenario-Based Experiments in Psychology and Management0
A Survey for Large Language Models in Biomedicine0
Ethical AI Governance: Methods for Evaluating Trustworthy AI0
Awes, Laws, and Flaws From Today's LLM Research0
Clinical Insights: A Comprehensive Review of Language Models in Medicine0
Beyond Labels: Aligning Large Language Models with Human-like ReasoningCode0
AI-Driven Review Systems: Evaluating LLMs in Scalable and Bias-Aware Academic Reviews0
Balancing Innovation and Ethics in AI-Driven Software Development0
ACL Ready: RAG Based Assistant for the ACL ChecklistCode0
A Conceptual Framework for Ethical Evaluation of Machine Learning Systems0
Show:102550
← PrevPage 9 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified