| What does the Knowledge Neuron Thesis Have to do with Knowledge? | May 3, 2024 | Model Editing | CodeCode Available | 0 |
| On Mechanistic Knowledge Localization in Text-to-Image Generative Models | May 2, 2024 | Model Editing | CodeCode Available | 1 |
| Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3 | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models | Apr 21, 2024 | Generative Adversarial NetworkModel Editing | CodeCode Available | 1 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| Locating and Editing Factual Associations in Mamba | Apr 4, 2024 | MambaModel Editing | CodeCode Available | 1 |
| Scalable Model Editing via Customized Expert Networks | Apr 3, 2024 | Hallucinationmodel | CodeCode Available | 0 |
| Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering | Mar 28, 2024 | HallucinationIn-Context Learning | CodeCode Available | 1 |
| Robust and Scalable Model Editing for Large Language Models | Mar 26, 2024 | Model Editing | CodeCode Available | 1 |
| A Unified Framework for Model Editing | Mar 21, 2024 | Memorizationmodel | CodeCode Available | 1 |
| BadEdit: Backdooring large language models by model editing | Mar 20, 2024 | Backdoor Attackknowledge editing | CodeCode Available | 1 |
| Editing Massive Concepts in Text-to-Image Diffusion Models | Mar 20, 2024 | Model Editing | CodeCode Available | 1 |
| Efficiently Quantifying and Mitigating Ripple Effects in Model Editing | Mar 12, 2024 | Model Editing | —Unverified | 0 |
| pyvene: A Library for Understanding and Improving PyTorch Models via Interventions | Mar 12, 2024 | Model Editing | CodeCode Available | 5 |
| Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing | Mar 11, 2024 | modelModel Editing | CodeCode Available | 1 |
| Consecutive Batch Model Editing with HooK Layers | Mar 8, 2024 | modelModel Editing | CodeCode Available | 0 |
| "Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models | Feb 29, 2024 | MisinformationModel Editing | CodeCode Available | 0 |
| Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models | Feb 28, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries | Feb 23, 2024 | Model EditingResponse Generation | CodeCode Available | 0 |
| Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond | Feb 22, 2024 | Meta-LearningModel Editing | —Unverified | 0 |
| Knowledge Graph Enhanced Large Language Model Editing | Feb 21, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Potential and Challenges of Model Editing for Social Debiasing | Feb 21, 2024 | Model Editing | —Unverified | 0 |
| Dense Passage Retrieval: Is it Retrieving? | Feb 16, 2024 | Model EditingPassage Retrieval | —Unverified | 0 |
| Model Editing by Standard Fine-Tuning | Feb 16, 2024 | Computational Efficiencymodel | CodeCode Available | 1 |
| Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) | Feb 16, 2024 | Model Editing | CodeCode Available | 2 |
| Towards Uncovering How Large Language Model Works: An Explainability Perspective | Feb 16, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models Collapse | Feb 15, 2024 | BenchmarkingModel Editing | CodeCode Available | 0 |
| Long-form evaluation of model editing | Feb 14, 2024 | Formmodel | CodeCode Available | 0 |
| Rethinking Machine Unlearning for Large Language Models | Feb 13, 2024 | Machine UnlearningManagement | —Unverified | 0 |
| Model Editing with Canonical Examples | Feb 9, 2024 | Language Modellingmodel | CodeCode Available | 1 |
| On the Robustness of Editing Large Language Models | Feb 8, 2024 | Model EditingText Generation | CodeCode Available | 0 |
| MEMORYLLM: Towards Self-Updatable Large Language Models | Feb 7, 2024 | Model Editing | CodeCode Available | 3 |
| Parameter-tuning-free data entry error unlearning with adaptive selective synaptic dampening | Feb 6, 2024 | Model Editing | CodeCode Available | 0 |
| Continual Learning for Large Language Models: A Survey | Feb 2, 2024 | Continual LearningContinual Pretraining | —Unverified | 0 |
| Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks | Jan 31, 2024 | counterfactualknowledge editing | —Unverified | 0 |
| SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering | Jan 31, 2024 | Model EditingWord Embeddings | CodeCode Available | 0 |
| From Understanding to Utilization: A Survey on Explainability for Large Language Models | Jan 23, 2024 | Model Editing | —Unverified | 0 |
| Cross-lingual Editing in Multilingual Language Models | Jan 19, 2024 | Model Editing | CodeCode Available | 0 |
| Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models | Jan 19, 2024 | Model EditingRed Teaming | CodeCode Available | 0 |
| Model Editing at Scale leads to Gradual and Catastrophic Forgetting | Jan 15, 2024 | Model EditingSpecificity | CodeCode Available | 0 |
| Editing Arbitrary Propositions in LLMs without Subject Labels | Jan 15, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue | Jan 9, 2024 | Model EditingNatural Language Inference | CodeCode Available | 1 |
| MPN: Leveraging Multilingual Patch Neuron for Cross-lingual Model Editing | Jan 6, 2024 | Model Editing | —Unverified | 0 |
| Large Language Models Relearn Removed Concepts | Jan 3, 2024 | Model Editing | CodeCode Available | 1 |
| A Comprehensive Study of Knowledge Editing for Large Language Models | Jan 2, 2024 | knowledge editingModel Editing | CodeCode Available | 5 |
| MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA | Dec 19, 2023 | Document ClassificationHallucination | CodeCode Available | 0 |
| History Matters: Temporal Knowledge Editing in Large Language Model | Dec 9, 2023 | knowledge editingLanguage Modeling | CodeCode Available | 1 |
| Neuron Patching: Semantic-based Neuron-level Language Model Repair for Code Generation | Dec 8, 2023 | Code GenerationLanguage Modeling | —Unverified | 0 |
| DemoCaricature: Democratising Caricature Generation with a Rough Sketch | Dec 7, 2023 | CaricatureModel Editing | —Unverified | 0 |