| Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs | Jul 22, 2024 | Model EditingRed Teaming | CodeCode Available | 1 |
| Model editing for distribution shifts in uranium oxide morphological analysis | Jul 22, 2024 | Deep LearningModel Editing | —Unverified | 0 |
| Mitigating Backdoor Attacks using Activation-Guided Model Editing | Jul 10, 2024 | Machine UnlearningModel Editing | —Unverified | 0 |
| LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Jun 28, 2024 | Mixture-of-ExpertsModel Editing | —Unverified | 0 |
| Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs? | Jun 27, 2024 | Model EditingPhilosophy | CodeCode Available | 0 |
| Sequential Editing for Lifelong Training of Speech Recognition Models | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| How Well Can Knowledge Edit Methods Edit Perplexing Knowledge? | Jun 25, 2024 | knowledge editingModel Editing | —Unverified | 0 |
| Stealth edits to large language models | Jun 18, 2024 | Language ModellingModel Editing | CodeCode Available | 0 |
| Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance | Jun 17, 2024 | knowledge editingModel Editing | —Unverified | 0 |
| Understanding the Collapse of LLMs in Model Editing | Jun 17, 2024 | Model Editing | CodeCode Available | 0 |