SOTAVerified

Model Editing

Papers

Showing 125 of 193 papers

TitleStatusHype
pyvene: A Library for Understanding and Improving PyTorch Models via InterventionsCode5
A Comprehensive Study of Knowledge Editing for Large Language ModelsCode5
Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and ValuesCode5
Neuron-Level Sequential Editing for Large Language ModelsCode3
AlphaEdit: Null-Space Constrained Knowledge Editing for Language ModelsCode3
MEMORYLLM: Towards Self-Updatable Large Language ModelsCode3
Sparse Autoencoders Find Highly Interpretable Features in Language ModelsCode3
Locating and Editing Factual Associations in GPTCode3
UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language ModelsCode2
Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron AnalysisCode2
Model Editing as a Robust and Denoised variant of DPO: A Case Study on ToxicityCode2
Decomposing and Editing Predictions by Modeling Model ComputationCode2
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)Code2
BiasEdit: Debiasing Stereotyped Language Models via Model EditingCode1
SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion ModelsCode1
The Mirage of Model Editing: Revisiting Evaluation in the WildCode1
Reinforced Lifelong Editing for Language ModelsCode1
Injecting Universal Jailbreak Backdoors into LLMs in MinutesCode1
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEditCode1
Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMsCode1
Perturbation-Restrained Sequential Model EditingCode1
Large Scale Knowledge WashingCode1
Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt LearningCode1
On Mechanistic Knowledge Localization in Text-to-Image Generative ModelsCode1
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3Code1
Show:102550
← PrevPage 1 of 8Next →

No leaderboard results yet.