| A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment | Apr 22, 2025 | Model Editing | —Unverified | 0 |
| REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models | Apr 20, 2025 | AttributeImage Generation | —Unverified | 0 |
| When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers | Apr 15, 2025 | Binary ClassificationDomain Generalization | —Unverified | 0 |
| NAACL2025 Tutorial: Adaptation of Large Language Models | Apr 4, 2025 | Code GenerationModel Editing | —Unverified | 0 |
| Efficient Model Editing with Task-Localized Sparse Fine-tuning | Apr 3, 2025 | DisentanglementModel Editing | CodeCode Available | 0 |
| Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching | Apr 3, 2025 | Answer GenerationEEG | CodeCode Available | 0 |
| Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models | Mar 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Exploiting Edited Large Language Models as General Scientific Optimizers | Mar 8, 2025 | Model Editing | —Unverified | 0 |
| GeoEdit: Geometric Knowledge Editing for Large Language Models | Feb 27, 2025 | General Knowledgeknowledge editing | —Unverified | 0 |
| A Causal Lens for Evaluating Faithfulness Metrics | Feb 26, 2025 | Decision MakingFact Checking | —Unverified | 0 |