| The State of AI Ethics Report (Volume 5) | Aug 9, 2021 | EthicsFairness | —Unverified | 0 |
| The State of Documentation Practices of Third-party Machine Learning Models and Datasets | Dec 22, 2023 | Ethics | —Unverified | 0 |
| The subtle language of exclusion: Identifying the Toxic Speech of Trans-exclusionary Radical Feminists | Jul 1, 2022 | Ethics | —Unverified | 0 |
| The Switch, the Ladder, and the Matrix: Models for Classifying AI Systems | Jul 7, 2024 | Ethics | —Unverified | 0 |
| The Tragedy of the AI Commons | Jun 9, 2020 | Ethics | —Unverified | 0 |
| Modeling Users and Online Communities for Abuse Detection: A Position on Ethics and Explainability | Mar 31, 2021 | Abuse DetectionAbusive Language | —Unverified | 0 |
| The Virtuous Machine - Old Ethics for New Technology? | Jun 27, 2018 | Autonomous DrivingEthics | —Unverified | 0 |
| Three Kinds of AI Ethics | Mar 24, 2025 | Ethics | —Unverified | 0 |
| To Be Forgotten or To Be Fair: Unveiling Fairness Implications of Machine Unlearning Methods | Feb 7, 2023 | EthicsFairness | —Unverified | 0 |
| Too sick for surveillance: Can federal HIV service data improve federal HIV surveillance efforts? | Apr 20, 2023 | Ethics | —Unverified | 0 |
| Toward Constraint Compliant Goal Formulation and Planning | May 21, 2024 | Ethics | —Unverified | 0 |
| Toward Ethical AIED | Mar 11, 2022 | Ethics | —Unverified | 0 |
| Towards a Feminist Metaethics of AI | Nov 10, 2023 | Ethics | —Unverified | 0 |
| Towards a Formalisation of Value-based Actions and Consequentialist Ethics | Mar 25, 2024 | Ethics | —Unverified | 0 |
| Towards a Framework Combining Machine Ethics and Machine Explainability | Jan 3, 2019 | Decision MakingEthics | —Unverified | 0 |
| Towards a Governance Framework for Brain Data | Sep 24, 2021 | Ethics | —Unverified | 0 |
| Towards AI Logic for Social Reasoning | Oct 9, 2021 | Ethics | —Unverified | 0 |
| Towards an Accountable and Reproducible Federated Learning: A FactSheets Approach | Feb 25, 2022 | EthicsFederated Learning | —Unverified | 0 |
| Towards an Environmental Ethics of Artificial Intelligence | Dec 19, 2024 | Ethics | —Unverified | 0 |
| Towards An Ethics-Audit Bot | Mar 29, 2021 | Ethics | —Unverified | 0 |
| Designing monitoring strategies for deployed machine learning algorithms: navigating performativity through a causal lens | Nov 20, 2023 | Causal InferenceEthics | —Unverified | 0 |
| Towards a Practical Ethics of Generative AI in Creative Production Processes | Nov 18, 2024 | EthicsNavigate | —Unverified | 0 |
| Towards a Praxis for Intercultural Ethics in Explainable AI | Apr 24, 2023 | EthicsExplainable Artificial Intelligence (XAI) | —Unverified | 0 |
| Exploring and steering the moral compass of Large Language Models | May 27, 2024 | AllDecision Making | CodeCode Available | 0 |
| EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval | Oct 2, 2023 | EthicsInformation Retrieval | CodeCode Available | 0 |
| Morality is Non-Binary: Building a Pluralist Moral Sentence Embedding Space using Contrastive Learning | Jan 30, 2024 | Contrastive LearningEthics | CodeCode Available | 0 |
| Learning From Revisions: Quality Assessment of Claims in Argumentation at Scale | Jan 25, 2021 | Ethics | CodeCode Available | 0 |
| Learning Human Action Recognition Representations Without Real Humans | Nov 10, 2023 | Action RecognitionEthics | CodeCode Available | 0 |
| ACL Ready: RAG Based Assistant for the ACL Checklist | Aug 7, 2024 | EthicsLanguage Modeling | CodeCode Available | 0 |
| More RLHF, More Trust? On The Impact of Preference Alignment On Trustworthiness | Apr 29, 2024 | EthicsLanguage Modelling | CodeCode Available | 0 |
| Towards a multi-stakeholder value-based assessment framework for algorithmic systems | May 9, 2022 | Ethics | CodeCode Available | 0 |
| How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities | Nov 15, 2023 | EthicsFairness | CodeCode Available | 0 |
| MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine Translation | Nov 2, 2022 | counterfactualEthics | CodeCode Available | 0 |
| A Recommendation and Risk Classification System for Connecting Rough Sleepers to Essential Outreach Services | Jul 30, 2020 | EthicsGeneral Classification | CodeCode Available | 0 |
| HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation | May 16, 2025 | BenchmarkingEthics | CodeCode Available | 0 |
| Decorrelation using Optimal Transport | Jul 11, 2023 | Binary ClassificationEthics | CodeCode Available | 0 |
| Surveying Professional Writers on AI: Limitations, Expectations, and Fears | Apr 7, 2025 | EthicsMisinformation | CodeCode Available | 0 |
| ApplE: An Applied Ethics Ontology with Event Context | Feb 7, 2025 | Ethics | CodeCode Available | 0 |
| What are People Talking about in #BlackLivesMatter and #StopAsianHate? Exploring and Categorizing Twitter Topics Emerging in Online Social Movements through the Latent Dirichlet Allocation Model | May 29, 2022 | Ethics | CodeCode Available | 0 |
| Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts | Mar 3, 2025 | Ethics | CodeCode Available | 0 |
| Data Defenses Against Large Language Models | Oct 17, 2024 | Ethics | CodeCode Available | 0 |
| A Low-Cost Ethics Shaping Approach for Designing Reinforcement Learning Agents | Dec 12, 2017 | Ethicsreinforcement-learning | CodeCode Available | 0 |
| A Group-Specific Approach to NLP for Hate Speech Detection | Apr 21, 2023 | Common Sense ReasoningEthics | CodeCode Available | 0 |
| Semantics derived automatically from language corpora contain human-like biases | Aug 25, 2016 | BIG-bench Machine LearningEthics | CodeCode Available | 0 |
| When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas | May 25, 2025 | EthicsNavigate | CodeCode Available | 0 |
| An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives | Oct 25, 2023 | EthicsExperimental Design | CodeCode Available | 0 |
| Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing | Apr 17, 2023 | EthicsSurvey | CodeCode Available | 0 |
| A History of Philosophy in Colombia through Topic Modelling | Dec 5, 2024 | ArticlesEthics | CodeCode Available | 0 |
| Informed AI Regulation: Comparing the Ethical Frameworks of Leading LLM Chatbots Using an Ethics-Based Audit to Assess Moral Reasoning and Normative Values | Jan 9, 2024 | Decision MakingEthics | CodeCode Available | 0 |
| TAPE: Assessing Few-shot Russian Language Understanding | Oct 23, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 0 |