SOTAVerified

Ethics

Papers

Showing 151200 of 832 papers

TitleStatusHype
From Efficiency to Equity: Measuring Fairness in Preference Learning0
Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection FrameworkCode0
Computational Grounding of Responsibility Attribution and Anticipation in LTLf0
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language ModelsCode0
Data Defenses Against Large Language ModelsCode0
Building Better: Avoiding Pitfalls in Developing Language Resources when Data is Scarce0
A Comparative Analysis on Ethical Benchmarking in Large Language Models0
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 OutlookCode2
TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty SimulationsCode0
Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructions0
Students' Perceptions and Use of Generative AI Tools for Programming Across Different Computing Courses0
Behavior Matters: An Alternative Perspective on Promoting Responsible Data Science0
Moral Alignment for LLM Agents0
Tesla's Autopilot: Ethics and Tragedy0
XTRUST: On the Multilingual Trustworthiness of Large Language ModelsCode1
RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert CollaborationCode0
Brain Surgery: Ensuring GDPR Compliance in Large Language Models via Concept Erasure0
Generative AI Carries Non-Democratic Biases and Stereotypes: Representation of Women, Black Individuals, Age Groups, and People with Disability in AI-Generated Images across Occupations0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs0
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications0
Generative AI for Requirements Engineering: A Systematic Literature Review0
Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric Conditions0
Declarative Integration and Management of Large Language Models through Finite Automata: Application to Automation, Communication, and Ethics0
3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos0
A Survey for Large Language Models in Biomedicine0
Can Large Language Models Replace Human Subjects? A Large-Scale Replication of Scenario-Based Experiments in Psychology and Management0
Ethical AI Governance: Methods for Evaluating Trustworthy AI0
Awes, Laws, and Flaws From Today's LLM Research0
Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive SurveyCode1
Clinical Insights: A Comprehensive Review of Language Models in Medicine0
Beyond Labels: Aligning Large Language Models with Human-like ReasoningCode0
AI-Driven Review Systems: Evaluating LLMs in Scalable and Bias-Aware Academic Reviews0
Balancing Innovation and Ethics in AI-Driven Software Development0
ACL Ready: RAG Based Assistant for the ACL ChecklistCode0
A Conceptual Framework for Ethical Evaluation of Machine Learning Systems0
Responsible AI Question Bank: A Comprehensive Tool for AI Risk Assessment0
On the Limitations and Prospects of Machine Unlearning for Generative AI0
Interactive embodied evolution for socially adept Artificial General Creatures0
An evidence-based methodology for human rights impact assessment (HRIA) in the development of AI data-intensive systems0
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review0
FairAIED: Navigating Fairness, Bias, and Ethics in Educational AI Applications0
Why Machines Can't Be Moral: Turing's Halting Problem and the Moral Limits of Artificial Intelligence0
Virtue Ethics For Ethically Tunable Robotic Assistants0
Arondight: Red Teaming Large Vision Language Models with Auto-generated Multi-modal Jailbreak Prompts0
Report on the Conference on Ethical and Responsible Design in the National AI Institutes: A Summary of Challenges0
Assurance of AI Systems From a Dependability Perspective0
Reducing Barriers to the Use of Marginalised Music Genres in AI0
Ethics of Generating Synthetic MRI Vocal Tract Views from the Face0
Advancements in Recommender Systems: A Comprehensive Analysis Based on Data, Algorithms, and Evaluation0
Show:102550
← PrevPage 4 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified