| RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment | Apr 13, 2023 | Ethics | CodeCode Available | 5 |
| TrustLLM: Trustworthiness in Large Language Models | Jan 10, 2024 | EthicsFairness | CodeCode Available | 4 |
| Visual Large Language Models for Generalized and Specialized Applications | Jan 6, 2025 | Ethics | CodeCode Available | 3 |
| How Can Recommender Systems Benefit from Large Language Models: A Survey | Jun 9, 2023 | EthicsFeature Engineering | CodeCode Available | 3 |
| A Survey on Evaluation of Large Language Models | Jul 6, 2023 | EthicsSurvey | CodeCode Available | 3 |
| Aligning AI With Shared Human Values | Aug 5, 2020 | Ethicsreinforcement-learning | CodeCode Available | 2 |
| Getting pwn'd by AI: Penetration Testing with Large Language Models | Jul 24, 2023 | EthicsTask Planning | CodeCode Available | 2 |
| JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs | Feb 8, 2024 | Ethics | CodeCode Available | 2 |
| Data-Centric Foundation Models in Computational Healthcare: A Survey | Jan 4, 2024 | EthicsSurvey | CodeCode Available | 2 |
| GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher | Aug 12, 2023 | EthicsRed Teaming | CodeCode Available | 2 |
| A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law | May 2, 2024 | DiagnosticEthics | CodeCode Available | 2 |
| On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook | Oct 11, 2024 | EthicsFairness | CodeCode Available | 2 |
| PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation | Jul 8, 2024 | EthicsLanguage Modeling | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey | Aug 23, 2024 | Ethics | CodeCode Available | 1 |
| Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Aug 11, 2023 | Adversarial RobustnessEthics | CodeCode Available | 1 |
| Ego4D: Around the World in 3,000 Hours of Egocentric Video | Oct 13, 2021 | De-identificationEthics | CodeCode Available | 1 |
| Ethics Sheets for AI Tasks | Jul 2, 2021 | ArticlesEmotion Recognition | CodeCode Available | 1 |
| MoralBench: Moral Evaluation of LLMs | Jun 6, 2024 | Ethics | CodeCode Available | 1 |
| Can Machines Learn Morality? The Delphi Experiment | Oct 14, 2021 | DescriptiveEthics | CodeCode Available | 1 |
| Deontological Ethics By Monotonicity Shape Constraints | Jan 31, 2020 | EthicsFairness | CodeCode Available | 1 |
| CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning Approaches | Sep 20, 2023 | EthicsGraph Matching | CodeCode Available | 1 |
| Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion models | Jun 5, 2023 | Brain Tumor SegmentationEthics | CodeCode Available | 1 |
| Ethics Sheet for Automatic Emotion Recognition and Sentiment Analysis | Sep 17, 2021 | ArticlesEmotion Recognition | CodeCode Available | 1 |
| Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark | Apr 6, 2023 | Decision MakingEthics | CodeCode Available | 1 |