| Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values | Jun 30, 2022 | Additive modelsBIG-bench Machine Learning | CodeCode Available | 5 |
| A Comprehensive Study of Knowledge Editing for Large Language Models | Jan 2, 2024 | knowledge editingModel Editing | CodeCode Available | 5 |
| pyvene: A Library for Understanding and Improving PyTorch Models via Interventions | Mar 12, 2024 | Model Editing | CodeCode Available | 5 |
| Locating and Editing Factual Associations in GPT | Feb 10, 2022 | counterfactualModel Editing | CodeCode Available | 3 |
| Sparse Autoencoders Find Highly Interpretable Features in Language Models | Sep 15, 2023 | counterfactualLanguage Modelling | CodeCode Available | 3 |
| MEMORYLLM: Towards Self-Updatable Large Language Models | Feb 7, 2024 | Model Editing | CodeCode Available | 3 |
| AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models | Oct 3, 2024 | knowledge editingModel Editing | CodeCode Available | 3 |
| Neuron-Level Sequential Editing for Large Language Models | Oct 5, 2024 | Model Editing | CodeCode Available | 3 |
| Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) | Feb 16, 2024 | Model Editing | CodeCode Available | 2 |
| Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity | May 22, 2024 | Language ModellingModel Editing | CodeCode Available | 2 |
| UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models | May 20, 2025 | GPULifelong learning | CodeCode Available | 2 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis | Sep 21, 2024 | Model EditingPrediction | CodeCode Available | 2 |
| PMET: Precise Model Editing in a Transformer | Aug 17, 2023 | General Knowledgemodel | CodeCode Available | 1 |
| Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models | Oct 16, 2023 | Model Editing | CodeCode Available | 1 |
| Perturbation-Restrained Sequential Model Editing | May 27, 2024 | Continual Learningmodel | CodeCode Available | 1 |
| Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors | Nov 20, 2022 | Model EditingWorld Knowledge | CodeCode Available | 1 |
| Model Editing with Canonical Examples | Feb 9, 2024 | Language Modellingmodel | CodeCode Available | 1 |
| On Mechanistic Knowledge Localization in Text-to-Image Generative Models | May 2, 2024 | Model Editing | CodeCode Available | 1 |
| Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing | Mar 11, 2024 | modelModel Editing | CodeCode Available | 1 |
| Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning | May 6, 2024 | knowledge editingLifelong learning | CodeCode Available | 1 |
| Learning to Model Editing Processes | May 24, 2022 | Machine Translationmodel | CodeCode Available | 1 |
| Model Editing by Standard Fine-Tuning | Feb 16, 2024 | Computational Efficiencymodel | CodeCode Available | 1 |
| Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue | Jan 9, 2024 | Model EditingNatural Language Inference | CodeCode Available | 1 |
| Massive Editing for Large Language Models via Meta Learning | Nov 8, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| ModelPS: An Interactive and Collaborative Platform for Editing Pre-trained Models at Scale | May 18, 2021 | Model Editing | CodeCode Available | 1 |
| Large Scale Knowledge Washing | May 26, 2024 | DecoderMemorization | CodeCode Available | 1 |
| Large Language Models Relearn Removed Concepts | Jan 3, 2024 | Model Editing | CodeCode Available | 1 |
| BadEdit: Backdooring large language models by model editing | Mar 20, 2024 | Backdoor Attackknowledge editing | CodeCode Available | 1 |
| Evaluating the Ripple Effects of Knowledge Editing in Language Models | Jul 24, 2023 | Diagnosticknowledge editing | CodeCode Available | 1 |
| Learn From Model Beyond Fine-Tuning: A Survey | Oct 12, 2023 | Meta-Learningmodel | CodeCode Available | 1 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Interpreting and Controlling Vision Foundation Models via Text Explanations | Oct 16, 2023 | Model EditingVisual Reasoning | CodeCode Available | 1 |
| Injecting Universal Jailbreak Backdoors into LLMs in Minutes | Feb 9, 2025 | Model Editing | CodeCode Available | 1 |
| DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models | Oct 31, 2023 | MemorizationModel Editing | CodeCode Available | 1 |
| Editing Massive Concepts in Text-to-Image Diffusion Models | Mar 20, 2024 | Model Editing | CodeCode Available | 1 |
| Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark | May 27, 2023 | Model EditingSpecificity | CodeCode Available | 1 |
| BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Mar 11, 2025 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Fast Model Editing at Scale | Oct 21, 2021 | GPULanguage Modelling | CodeCode Available | 1 |
| Editing Large Language Models: Problems, Methods, and Opportunities | May 22, 2023 | Model Editing | CodeCode Available | 1 |
| Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models | Jan 10, 2023 | Denoisingknowledge editing | CodeCode Available | 1 |
| Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks | Sep 29, 2023 | Model Editing | CodeCode Available | 1 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| DUnE: Dataset for Unified Editing | Nov 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Editing Implicit Assumptions in Text-to-Image Diffusion Models | Mar 14, 2023 | AttributeModel Editing | CodeCode Available | 1 |
| Locating and Editing Factual Associations in Mamba | Apr 4, 2024 | MambaModel Editing | CodeCode Available | 1 |
| A Unified Framework for Model Editing | Mar 21, 2024 | Memorizationmodel | CodeCode Available | 1 |
| History Matters: Temporal Knowledge Editing in Large Language Model | Dec 9, 2023 | knowledge editingLanguage Modeling | CodeCode Available | 1 |
| Reinforced Lifelong Editing for Language Models | Feb 9, 2025 | Model Editing | CodeCode Available | 1 |
| Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3 | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |