| RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment | Apr 13, 2023 | Ethics | CodeCode Available | 5 |
| TrustLLM: Trustworthiness in Large Language Models | Jan 10, 2024 | EthicsFairness | CodeCode Available | 4 |
| Visual Large Language Models for Generalized and Specialized Applications | Jan 6, 2025 | Ethics | CodeCode Available | 3 |
| A Survey on Evaluation of Large Language Models | Jul 6, 2023 | EthicsSurvey | CodeCode Available | 3 |
| How Can Recommender Systems Benefit from Large Language Models: A Survey | Jun 9, 2023 | EthicsFeature Engineering | CodeCode Available | 3 |
| On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook | Oct 11, 2024 | EthicsFairness | CodeCode Available | 2 |
| PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation | Jul 8, 2024 | EthicsLanguage Modeling | CodeCode Available | 2 |
| A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law | May 2, 2024 | DiagnosticEthics | CodeCode Available | 2 |
| JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs | Feb 8, 2024 | Ethics | CodeCode Available | 2 |
| Data-Centric Foundation Models in Computational Healthcare: A Survey | Jan 4, 2024 | EthicsSurvey | CodeCode Available | 2 |
| GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher | Aug 12, 2023 | EthicsRed Teaming | CodeCode Available | 2 |
| Getting pwn'd by AI: Penetration Testing with Large Language Models | Jul 24, 2023 | EthicsTask Planning | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Aligning AI With Shared Human Values | Aug 5, 2020 | Ethicsreinforcement-learning | CodeCode Available | 2 |
| XTRUST: On the Multilingual Trustworthiness of Large Language Models | Sep 24, 2024 | EthicsFairness | CodeCode Available | 1 |
| Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey | Aug 23, 2024 | Ethics | CodeCode Available | 1 |
| Language Model Alignment in Multilingual Trolley Problems | Jul 2, 2024 | Decision MakingEthics | CodeCode Available | 1 |
| MoralBench: Moral Evaluation of LLMs | Jun 6, 2024 | Ethics | CodeCode Available | 1 |
| MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models | Mar 6, 2024 | EthicsGeneral Knowledge | CodeCode Available | 1 |
| NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism | Feb 29, 2024 | EthicsMultiple-choice | CodeCode Available | 1 |
| E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models | Jan 29, 2024 | EthicsMultiple-choice | CodeCode Available | 1 |
| A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics | Oct 9, 2023 | EthicsFairness | CodeCode Available | 1 |
| CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning Approaches | Sep 20, 2023 | EthicsGraph Matching | CodeCode Available | 1 |
| Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics | Sep 13, 2023 | EthicsTruthfulQA | CodeCode Available | 1 |
| TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection | Aug 21, 2023 | Anomaly DetectionAttribute | CodeCode Available | 1 |
| Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Aug 11, 2023 | Adversarial RobustnessEthics | CodeCode Available | 1 |
| Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion models | Jun 5, 2023 | Brain Tumor SegmentationEthics | CodeCode Available | 1 |
| Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark | Apr 6, 2023 | Decision MakingEthics | CodeCode Available | 1 |
| Synthetically generated text for supervised text analysis | Mar 28, 2023 | ArticlesEthics | CodeCode Available | 1 |
| AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N | Aug 15, 2022 | EthicsMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Automated Kantian Ethics: A Faithful Implementation | Jul 20, 2022 | Ethics | CodeCode Available | 1 |
| Worldwide AI Ethics: a review of 200 guidelines and recommendations for AI governance | Jun 23, 2022 | Data VisualizationEthics | CodeCode Available | 1 |
| Artificial Intelligence Ethics and Safety: practical tools for creating "good" models | Dec 14, 2021 | Ethics | CodeCode Available | 1 |
| Can Machines Learn Morality? The Delphi Experiment | Oct 14, 2021 | DescriptiveEthics | CodeCode Available | 1 |
| Ego4D: Around the World in 3,000 Hours of Egocentric Video | Oct 13, 2021 | De-identificationEthics | CodeCode Available | 1 |
| PASS: An ImageNet replacement for self-supervised pretraining without humans | Sep 27, 2021 | BenchmarkingEthics | CodeCode Available | 1 |
| Ethics Sheet for Automatic Emotion Recognition and Sentiment Analysis | Sep 17, 2021 | ArticlesEmotion Recognition | CodeCode Available | 1 |
| Ethics Sheets for AI Tasks | Jul 2, 2021 | ArticlesEmotion Recognition | CodeCode Available | 1 |
| VERB: Visualizing and Interpreting Bias Mitigation Techniques for Word Representations | Apr 6, 2021 | Decision MakingDimensionality Reduction | CodeCode Available | 1 |
| Evaluating the Clinical Realism of Synthetic Chest X-Rays Generated Using Progressively Growing GANs | Oct 7, 2020 | Conditional Image GenerationData Augmentation | CodeCode Available | 1 |
| Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes | Aug 20, 2020 | DescriptiveEthics | CodeCode Available | 1 |
| Deontological Ethics By Monotonicity Shape Constraints | Jan 31, 2020 | EthicsFairness | CodeCode Available | 1 |
| Teaching Software Engineering for AI-Enabled Systems | Jan 18, 2020 | EthicsFairness | CodeCode Available | 1 |
| The Ethical Implications of AI in Creative Industries: A Focus on AI-Generated Art | Jul 8, 2025 | EthicsMisinformation | —Unverified | 0 |
| Feeling Machines: Ethics, Culture, and the Rise of Emotional AI | Jun 14, 2025 | EthicsNavigate | —Unverified | 0 |
| Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics | Jun 14, 2025 | Computational EfficiencyEthics | —Unverified | 0 |
| SocialCredit+ | Jun 12, 2025 | Credit scoreEthics | —Unverified | 0 |
| "I Hadn't Thought About That": Creators of Human-like AI Weigh in on Ethics And Neurodivergence | Jun 12, 2025 | Ethics | —Unverified | 0 |
| Extended Creativity: A Conceptual Framework for Understanding Human-AI Creative Relations | Jun 12, 2025 | Ethics | —Unverified | 0 |
| MIRA: Medical Time Series Foundation Model for Real-World Health Data | Jun 9, 2025 | EthicsMissing Values | —Unverified | 0 |