SOTAVerified

Ethics

Papers

Showing 151175 of 832 papers

TitleStatusHype
From Efficiency to Equity: Measuring Fairness in Preference Learning0
Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection FrameworkCode0
Computational Grounding of Responsibility Attribution and Anticipation in LTLf0
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language ModelsCode0
Data Defenses Against Large Language ModelsCode0
Building Better: Avoiding Pitfalls in Developing Language Resources when Data is Scarce0
A Comparative Analysis on Ethical Benchmarking in Large Language Models0
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 OutlookCode2
TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty SimulationsCode0
Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructions0
Students' Perceptions and Use of Generative AI Tools for Programming Across Different Computing Courses0
Behavior Matters: An Alternative Perspective on Promoting Responsible Data Science0
Moral Alignment for LLM Agents0
Tesla's Autopilot: Ethics and Tragedy0
XTRUST: On the Multilingual Trustworthiness of Large Language ModelsCode1
RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert CollaborationCode0
Brain Surgery: Ensuring GDPR Compliance in Large Language Models via Concept Erasure0
Generative AI Carries Non-Democratic Biases and Stereotypes: Representation of Women, Black Individuals, Age Groups, and People with Disability in AI-Generated Images across Occupations0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs0
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications0
Generative AI for Requirements Engineering: A Systematic Literature Review0
Introducing ELLIPS: An Ethics-Centered Approach to Research on LLM-Based Inference of Psychiatric Conditions0
Declarative Integration and Management of Large Language Models through Finite Automata: Application to Automation, Communication, and Ethics0
3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos0
Show:102550
← PrevPage 7 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified