| Towards a multi-stakeholder value-based assessment framework for algorithmic systems | May 9, 2022 | Ethics | CodeCode Available | 0 |
| How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities | Nov 15, 2023 | EthicsFairness | CodeCode Available | 0 |
| MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine Translation | Nov 2, 2022 | counterfactualEthics | CodeCode Available | 0 |
| A Recommendation and Risk Classification System for Connecting Rough Sleepers to Essential Outreach Services | Jul 30, 2020 | EthicsGeneral Classification | CodeCode Available | 0 |
| HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation | May 16, 2025 | BenchmarkingEthics | CodeCode Available | 0 |
| Decorrelation using Optimal Transport | Jul 11, 2023 | Binary ClassificationEthics | CodeCode Available | 0 |
| Surveying Professional Writers on AI: Limitations, Expectations, and Fears | Apr 7, 2025 | EthicsMisinformation | CodeCode Available | 0 |
| ApplE: An Applied Ethics Ontology with Event Context | Feb 7, 2025 | Ethics | CodeCode Available | 0 |
| What are People Talking about in #BlackLivesMatter and #StopAsianHate? Exploring and Categorizing Twitter Topics Emerging in Online Social Movements through the Latent Dirichlet Allocation Model | May 29, 2022 | Ethics | CodeCode Available | 0 |
| Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts | Mar 3, 2025 | Ethics | CodeCode Available | 0 |