SOTAVerified

An Evaluation of GPT-4 on the ETHICS Dataset

2023-09-19Unverified0· sign in to hype

Sergey Rodionov, Zarathustra Amadeus Goertzel, Ben Goertzel

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This report summarizes a short study of the performance of GPT-4 on the ETHICS dataset. The ETHICS dataset consists of five sub-datasets covering different fields of ethics: Justice, Deontology, Virtue Ethics, Utilitarianism, and Commonsense Ethics. The moral judgments were curated so as to have a high degree of agreement with the aim of representing shared human values rather than moral dilemmas. GPT-4's performance is much better than that of previous models and suggests that learning to work with common human values is not the hard problem for AI ethics.

Tasks

Reproductions