| Position: Editing Large Language Models Poses Serious Safety Risks | Feb 5, 2025 | knowledge editingModel Editing | —Unverified | 0 | 0 |
| Post-hoc Concept Bottleneck Models | May 31, 2022 | Model Editing | —Unverified | 0 | 0 |
| Potential and Challenges of Model Editing for Social Debiasing | Feb 21, 2024 | Model Editing | —Unverified | 0 | 0 |
| Promoting Equality in Large Language Models: Identifying and Mitigating the Implicit Bias based on Bayesian Theory | Aug 20, 2024 | Model Editing | —Unverified | 0 | 0 |
| Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks | Jan 31, 2024 | counterfactualknowledge editing | —Unverified | 0 | 0 |
| A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment | Apr 22, 2025 | Model Editing | —Unverified | 0 | 0 |
| Can We Edit Multimodal Large Language Models? | Oct 12, 2023 | Model Editing | —Unverified | 0 | 0 |
| REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing | May 25, 2025 | knowledge editingLanguage Modeling | —Unverified | 0 | 0 |
| Can LLMs be Fooled? Investigating Vulnerabilities in LLMs | Jul 30, 2024 | Model Editing | —Unverified | 0 | 0 |
| REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models | Apr 20, 2025 | AttributeImage Generation | —Unverified | 0 | 0 |