| Modeling Emotions and Ethics with Large Language Models | Apr 15, 2024 | Decision MakingEthics | CodeCode Available | 0 | 5 |
| More RLHF, More Trust? On The Impact of Preference Alignment On Trustworthiness | Apr 29, 2024 | EthicsLanguage Modelling | CodeCode Available | 0 | 5 |
| MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active Learning | Dec 30, 2021 | Active LearningEthics | CodeCode Available | 0 | 5 |
| Informed AI Regulation: Comparing the Ethical Frameworks of Leading LLM Chatbots Using an Ethics-Based Audit to Assess Moral Reasoning and Normative Values | Jan 9, 2024 | Decision MakingEthics | CodeCode Available | 0 | 5 |
| Learning From Revisions: Quality Assessment of Claims in Argumentation at Scale | Jan 25, 2021 | Ethics | CodeCode Available | 0 | 5 |
| HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation | May 16, 2025 | BenchmarkingEthics | CodeCode Available | 0 | 5 |
| How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities | Nov 15, 2023 | EthicsFairness | CodeCode Available | 0 | 5 |
| Learning Human Action Recognition Representations Without Real Humans | Nov 10, 2023 | Action RecognitionEthics | CodeCode Available | 0 | 5 |
| MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine Translation | Nov 2, 2022 | counterfactualEthics | CodeCode Available | 0 | 5 |
| Semantics derived automatically from language corpora contain human-like biases | Aug 25, 2016 | BIG-bench Machine LearningEthics | CodeCode Available | 0 | 5 |
| RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert Collaboration | Sep 23, 2024 | ChatbotEthics | CodeCode Available | 0 | 5 |
| Responsible Design Patterns for Machine Learning Pipelines | May 31, 2023 | EthicsManagement | CodeCode Available | 0 | 5 |
| Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models | Oct 17, 2024 | Ethics | CodeCode Available | 0 | 5 |
| Exploring and steering the moral compass of Large Language Models | May 27, 2024 | AllDecision Making | CodeCode Available | 0 | 5 |
| Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model Multiplicity | Mar 14, 2022 | EthicsFairness | CodeCode Available | 0 | 5 |
| ACL Ready: RAG Based Assistant for the ACL Checklist | Aug 7, 2024 | EthicsLanguage Modeling | CodeCode Available | 0 | 5 |
| Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models | Sep 19, 2024 | EthicsMultiple-choice | CodeCode Available | 0 | 5 |
| Achieving Distributive Justice in Federated Learning via Uncertainty Quantification | Apr 22, 2025 | EthicsFairness | CodeCode Available | 0 | 5 |
| EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval | Oct 2, 2023 | EthicsInformation Retrieval | CodeCode Available | 0 | 5 |
| Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions? | Jun 2, 2021 | EthicsFew-Shot Learning | CodeCode Available | 0 | 5 |
| Decorrelation using Optimal Transport | Jul 11, 2023 | Binary ClassificationEthics | CodeCode Available | 0 | 5 |
| Data Defenses Against Large Language Models | Oct 17, 2024 | Ethics | CodeCode Available | 0 | 5 |
| Defining a Sandbox for Responsible AI | Sep 25, 2018 | Ethics | CodeCode Available | 0 | 5 |
| CleftGAN: Adapting A Style-Based Generative Adversarial Network To Create Images Depicting Cleft Lip Deformity | Oct 12, 2023 | Data AugmentationEthics | CodeCode Available | 0 | 5 |
| Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude | Jan 17, 2025 | AttributeDecision Making | CodeCode Available | 0 | 5 |