| Data Defenses Against Large Language Models | Oct 17, 2024 | Ethics | CodeCode Available | 0 |
| A Low-Cost Ethics Shaping Approach for Designing Reinforcement Learning Agents | Dec 12, 2017 | Ethicsreinforcement-learning | CodeCode Available | 0 |
| A Group-Specific Approach to NLP for Hate Speech Detection | Apr 21, 2023 | Common Sense ReasoningEthics | CodeCode Available | 0 |
| Semantics derived automatically from language corpora contain human-like biases | Aug 25, 2016 | BIG-bench Machine LearningEthics | CodeCode Available | 0 |
| When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas | May 25, 2025 | EthicsNavigate | CodeCode Available | 0 |
| An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives | Oct 25, 2023 | EthicsExperimental Design | CodeCode Available | 0 |
| Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing | Apr 17, 2023 | EthicsSurvey | CodeCode Available | 0 |
| A History of Philosophy in Colombia through Topic Modelling | Dec 5, 2024 | ArticlesEthics | CodeCode Available | 0 |
| Informed AI Regulation: Comparing the Ethical Frameworks of Leading LLM Chatbots Using an Ethics-Based Audit to Assess Moral Reasoning and Normative Values | Jan 9, 2024 | Decision MakingEthics | CodeCode Available | 0 |
| TAPE: Assessing Few-shot Russian Language Understanding | Oct 23, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 0 |