| A Unified Framework for Model Editing | Mar 21, 2024 | Memorizationmodel | CodeCode Available | 1 |
| BadEdit: Backdooring large language models by model editing | Mar 20, 2024 | Backdoor Attackknowledge editing | CodeCode Available | 1 |
| Editing Massive Concepts in Text-to-Image Diffusion Models | Mar 20, 2024 | Model Editing | CodeCode Available | 1 |
| Efficiently Quantifying and Mitigating Ripple Effects in Model Editing | Mar 12, 2024 | Model Editing | —Unverified | 0 |
| pyvene: A Library for Understanding and Improving PyTorch Models via Interventions | Mar 12, 2024 | Model Editing | CodeCode Available | 5 |
| Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing | Mar 11, 2024 | modelModel Editing | CodeCode Available | 1 |
| Consecutive Batch Model Editing with HooK Layers | Mar 8, 2024 | modelModel Editing | CodeCode Available | 0 |
| "Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models | Feb 29, 2024 | MisinformationModel Editing | CodeCode Available | 0 |
| Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models | Feb 28, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries | Feb 23, 2024 | Model EditingResponse Generation | CodeCode Available | 0 |