| RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment | Apr 13, 2023 | Ethics | CodeCode Available | 5 |
| TrustLLM: Trustworthiness in Large Language Models | Jan 10, 2024 | EthicsFairness | CodeCode Available | 4 |
| Visual Large Language Models for Generalized and Specialized Applications | Jan 6, 2025 | Ethics | CodeCode Available | 3 |
| A Survey on Evaluation of Large Language Models | Jul 6, 2023 | EthicsSurvey | CodeCode Available | 3 |
| How Can Recommender Systems Benefit from Large Language Models: A Survey | Jun 9, 2023 | EthicsFeature Engineering | CodeCode Available | 3 |
| On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook | Oct 11, 2024 | EthicsFairness | CodeCode Available | 2 |
| PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation | Jul 8, 2024 | EthicsLanguage Modeling | CodeCode Available | 2 |
| A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law | May 2, 2024 | DiagnosticEthics | CodeCode Available | 2 |
| JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs | Feb 8, 2024 | Ethics | CodeCode Available | 2 |
| Data-Centric Foundation Models in Computational Healthcare: A Survey | Jan 4, 2024 | EthicsSurvey | CodeCode Available | 2 |
| GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher | Aug 12, 2023 | EthicsRed Teaming | CodeCode Available | 2 |
| Getting pwn'd by AI: Penetration Testing with Large Language Models | Jul 24, 2023 | EthicsTask Planning | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Aligning AI With Shared Human Values | Aug 5, 2020 | Ethicsreinforcement-learning | CodeCode Available | 2 |
| XTRUST: On the Multilingual Trustworthiness of Large Language Models | Sep 24, 2024 | EthicsFairness | CodeCode Available | 1 |
| Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey | Aug 23, 2024 | Ethics | CodeCode Available | 1 |
| Language Model Alignment in Multilingual Trolley Problems | Jul 2, 2024 | Decision MakingEthics | CodeCode Available | 1 |
| MoralBench: Moral Evaluation of LLMs | Jun 6, 2024 | Ethics | CodeCode Available | 1 |
| MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models | Mar 6, 2024 | EthicsGeneral Knowledge | CodeCode Available | 1 |
| NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism | Feb 29, 2024 | EthicsMultiple-choice | CodeCode Available | 1 |
| E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models | Jan 29, 2024 | EthicsMultiple-choice | CodeCode Available | 1 |
| A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics | Oct 9, 2023 | EthicsFairness | CodeCode Available | 1 |
| CATS: Conditional Adversarial Trajectory Synthesis for Privacy-Preserving Trajectory Data Publication Using Deep Learning Approaches | Sep 20, 2023 | EthicsGraph Matching | CodeCode Available | 1 |
| Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics | Sep 13, 2023 | EthicsTruthfulQA | CodeCode Available | 1 |
| TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection | Aug 21, 2023 | Anomaly DetectionAttribute | CodeCode Available | 1 |