| Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching | Apr 3, 2025 | Answer GenerationEEG | CodeCode Available | 0 |
| Long-form evaluation of model editing | Feb 14, 2024 | Formmodel | CodeCode Available | 0 |
| LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning Where to Edit Vision Transformers | Nov 4, 2024 | Meta-LearningModel Editing | CodeCode Available | 0 |
| Resolving Lexical Bias in Edit Scoping with Projector Editor Networks | Aug 19, 2024 | Contrastive LearningModel Editing | CodeCode Available | 0 |
| The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models Collapse | Feb 15, 2024 | BenchmarkingModel Editing | CodeCode Available | 0 |
| MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA | Dec 19, 2023 | Document ClassificationHallucination | CodeCode Available | 0 |
| Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models | Mar 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding the Collapse of LLMs in Model Editing | Jun 17, 2024 | Model Editing | CodeCode Available | 0 |
| Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models | Feb 28, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| Language Anisotropic Cross-Lingual Model Editing | May 25, 2022 | modelModel Editing | CodeCode Available | 0 |
| On the Robustness of Editing Large Language Models | Feb 8, 2024 | Model EditingText Generation | CodeCode Available | 0 |
| Cross-lingual Editing in Multilingual Language Models | Jan 19, 2024 | Model Editing | CodeCode Available | 0 |
| Unveiling Concept Attribution in Diffusion Models | Dec 3, 2024 | Model Editing | CodeCode Available | 0 |
| Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs | Jun 16, 2025 | DiversityModel Editing | CodeCode Available | 0 |
| Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing | Oct 9, 2024 | Machine TranslationModel Editing | CodeCode Available | 0 |
| Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm | Jun 25, 2025 | Model Editing | CodeCode Available | 0 |
| Model Editing at Scale leads to Gradual and Catastrophic Forgetting | Jan 15, 2024 | Model EditingSpecificity | CodeCode Available | 0 |
| Editing Common Sense in Transformers | May 24, 2023 | Common Sense ReasoningModel Editing | CodeCode Available | 0 |
| A cost-effective method for improving and re-purposing large, pre-trained GANs by fine-tuning their class-embeddings | Oct 10, 2019 | DiversityModel Editing | CodeCode Available | 0 |
| How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries | Feb 23, 2024 | Model EditingResponse Generation | CodeCode Available | 0 |
| What does the Knowledge Neuron Thesis Have to do with Knowledge? | May 3, 2024 | Model Editing | CodeCode Available | 0 |
| Model Editing for LLMs4Code: How Far are We? | Nov 11, 2024 | 16kCode Generation | CodeCode Available | 0 |
| Gradient Rewiring for Editable Graph Neural Network Training | Oct 21, 2024 | Graph Neural NetworkModel Editing | CodeCode Available | 0 |
| WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models | May 23, 2024 | HallucinationModel Editing | CodeCode Available | 0 |
| Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs? | Jun 27, 2024 | Model EditingPhilosophy | CodeCode Available | 0 |
| Scalable Model Editing via Customized Expert Networks | Apr 3, 2024 | Hallucinationmodel | CodeCode Available | 0 |
| Consecutive Batch Model Editing with HooK Layers | Mar 8, 2024 | modelModel Editing | CodeCode Available | 0 |
| UniErase: Unlearning Token as a Universal Erasure Primitive for Language Models | May 21, 2025 | Machine UnlearningModel Editing | CodeCode Available | 0 |
| Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing | Dec 17, 2024 | MisinformationModel Editing | CodeCode Available | 0 |
| NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization | May 17, 2025 | AttributeModel Editing | CodeCode Available | 0 |
| Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification | Dec 21, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Should We Really Edit Language Models? On the Evaluation of Edited Language Models | Oct 24, 2024 | General KnowledgeModel Editing | CodeCode Available | 0 |
| CoME: An Unlearning-based Approach to Conflict-free Model Editing | Feb 20, 2025 | Model Editing | CodeCode Available | 0 |
| "Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models | Feb 29, 2024 | MisinformationModel Editing | CodeCode Available | 0 |
| Can We Edit Multimodal Large Language Models? | Oct 12, 2023 | Model Editing | CodeCode Available | 0 |
| Parameter-tuning-free data entry error unlearning with adaptive selective synaptic dampening | Feb 6, 2024 | Model Editing | CodeCode Available | 0 |
| Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Models | Oct 25, 2024 | backdoor defenseModel Editing | CodeCode Available | 0 |
| ELDER: Enhancing Lifelong Model Editing with Mixture-of-LoRA | Aug 19, 2024 | Model Editing | CodeCode Available | 0 |
| Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models | Jan 19, 2024 | Model EditingRed Teaming | CodeCode Available | 0 |
| Drop Dropout on Single-Epoch Language Model Pretraining | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models | May 31, 2024 | HallucinationModel Editing | CodeCode Available | 0 |
| Stealth edits to large language models | Jun 18, 2024 | Language ModellingModel Editing | CodeCode Available | 0 |