| Cross-Model Transfer of Task Vectors via Few-Shot Orthogonal Alignment | May 17, 2025 | Model EditingTask Arithmetic | CodeCode Available | 0 |
| BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing | May 2, 2025 | knowledge editingModel Editing | —Unverified | 0 |
| A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment | Apr 22, 2025 | Model Editing | —Unverified | 0 |
| REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models | Apr 20, 2025 | AttributeImage Generation | —Unverified | 0 |
| When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers | Apr 15, 2025 | Binary ClassificationDomain Generalization | —Unverified | 0 |
| NAACL2025 Tutorial: Adaptation of Large Language Models | Apr 4, 2025 | Code GenerationModel Editing | —Unverified | 0 |
| Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching | Apr 3, 2025 | Answer GenerationEEG | CodeCode Available | 0 |
| Efficient Model Editing with Task-Localized Sparse Fine-tuning | Apr 3, 2025 | DisentanglementModel Editing | CodeCode Available | 0 |
| Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models | Mar 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Mar 11, 2025 | counterfactualLanguage Modeling | CodeCode Available | 1 |