| GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher | Aug 12, 2023 | EthicsRed Teaming | CodeCode Available | 2 | 5 |
| Getting pwn'd by AI: Penetration Testing with Large Language Models | Jul 24, 2023 | EthicsTask Planning | CodeCode Available | 2 | 5 |
| Data-Centric Foundation Models in Computational Healthcare: A Survey | Jan 4, 2024 | EthicsSurvey | CodeCode Available | 2 | 5 |
| PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation | Jul 8, 2024 | EthicsLanguage Modeling | CodeCode Available | 2 | 5 |
| A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics | Oct 9, 2023 | EthicsFairness | CodeCode Available | 1 | 5 |
| Artificial Intelligence Ethics and Safety: practical tools for creating "good" models | Dec 14, 2021 | Ethics | CodeCode Available | 1 | 5 |
| Ethics Sheet for Automatic Emotion Recognition and Sentiment Analysis | Sep 17, 2021 | ArticlesEmotion Recognition | CodeCode Available | 1 | 5 |
| Ethics Sheets for AI Tasks | Jul 2, 2021 | ArticlesEmotion Recognition | CodeCode Available | 1 | 5 |
| Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark | Apr 6, 2023 | Decision MakingEthics | CodeCode Available | 1 | 5 |
| Can Machines Learn Morality? The Delphi Experiment | Oct 14, 2021 | DescriptiveEthics | CodeCode Available | 1 | 5 |