SOTAVerified

Ethics

Papers

Showing 201250 of 832 papers

TitleStatusHype
Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructions0
Students' Perceptions and Use of Generative AI Tools for Programming Across Different Computing Courses0
Behavior Matters: An Alternative Perspective on Promoting Responsible Data Science0
Moral Alignment for LLM Agents0
Tesla's Autopilot: Ethics and Tragedy0
RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert CollaborationCode0
Brain Surgery: Ensuring GDPR Compliance in Large Language Models via Concept Erasure0
Generative AI Carries Non-Democratic Biases and Stereotypes: Representation of Women, Black Individuals, Age Groups, and People with Disability in AI-Generated Images across Occupations0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs0
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications0
Generative AI for Requirements Engineering: A Systematic Literature Review0
Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric Conditions0
Declarative Integration and Management of Large Language Models through Finite Automata: Application to Automation, Communication, and Ethics0
3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos0
Can Large Language Models Replace Human Subjects? A Large-Scale Replication of Scenario-Based Experiments in Psychology and Management0
A Survey for Large Language Models in Biomedicine0
Ethical AI Governance: Methods for Evaluating Trustworthy AI0
Awes, Laws, and Flaws From Today's LLM Research0
Clinical Insights: A Comprehensive Review of Language Models in Medicine0
Beyond Labels: Aligning Large Language Models with Human-like ReasoningCode0
AI-Driven Review Systems: Evaluating LLMs in Scalable and Bias-Aware Academic Reviews0
Balancing Innovation and Ethics in AI-Driven Software Development0
ACL Ready: RAG Based Assistant for the ACL ChecklistCode0
A Conceptual Framework for Ethical Evaluation of Machine Learning Systems0
Responsible AI Question Bank: A Comprehensive Tool for AI Risk Assessment0
On the Limitations and Prospects of Machine Unlearning for Generative AI0
Interactive embodied evolution for socially adept Artificial General Creatures0
An evidence-based methodology for human rights impact assessment (HRIA) in the development of AI data-intensive systems0
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review0
FairAIED: Navigating Fairness, Bias, and Ethics in Educational AI Applications0
Why Machines Can't Be Moral: Turing's Halting Problem and the Moral Limits of Artificial Intelligence0
Virtue Ethics For Ethically Tunable Robotic Assistants0
Arondight: Red Teaming Large Vision Language Models with Auto-generated Multi-modal Jailbreak Prompts0
Report on the Conference on Ethical and Responsible Design in the National AI Institutes: A Summary of Challenges0
Reducing Barriers to the Use of Marginalised Music Genres in AI0
Assurance of AI Systems From a Dependability Perspective0
Ethics of Generating Synthetic MRI Vocal Tract Views from the Face0
Why should we ever automate moral decision making?0
Promoting AI Competencies for Medical Students: A Scoping Review on Frameworks, Programs, and Tools0
Advancements in Recommender Systems: A Comprehensive Analysis Based on Data, Algorithms, and Evaluation0
Challenges and Best Practices in Corporate AI Governance:Lessons from the Biopharmaceutical Industry0
The Switch, the Ladder, and the Matrix: Models for Classifying AI Systems0
Some Issues in Predictive Ethics Modeling: An Annotated Contrast Set of "Moral Stories"Code0
Reinforcement Learning and Machine ethics:a systematic review0
The African Woman is Rhythmic and Soulful: An Investigation of Implicit Biases in LLM Open-ended Text Generation0
SecGenAI: Enhancing Security of Cloud-based Generative AI Applications within Australian Critical Technologies of National Interest0
AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations0
Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models?0
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing0
Show:102550
← PrevPage 5 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified