Task Arithmetic

A task vector specifies a direction in the weight space of a pre-trained model, such that movement in that direction improves performance on the task. We build task vectors by subtracting the weights of a pre-trained model from the weights of the same model after fine-tuning on a task. We show that these task vectors can be modified and combined together through arithmetic operations such as negation and addition, and the behavior of the resulting model is steered accordingly.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 61 papers

Title	Date	Tasks	Status
MCU: Improving Machine Unlearning through Mode Connectivity	May 16, 2025	image-classificationImage Classification	—Unverified
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging	May 11, 2025	Task Arithmetic	—Unverified
Investigating Task Arithmetic for Zero-Shot Information Retrieval	May 1, 2025	Information RetrievalRe-Ranking	CodeCode Available
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers	Apr 15, 2025	Binary ClassificationDomain Generalization	—Unverified
Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMs	Apr 15, 2025	Task Arithmetic	CodeCode Available
Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning	Apr 15, 2025	Multi-Task LearningScene Understanding	—Unverified
Efficient Model Editing with Task-Localized Sparse Fine-tuning	Apr 3, 2025	DisentanglementModel Editing	CodeCode Available
OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models	Apr 2, 2025	Task Arithmetic	—Unverified
Disentangling Task Interference within Neurons: Model Merging in Alignment with Neuronal Mechanisms	Mar 7, 2025	Task Arithmetic	—Unverified
Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge	Feb 27, 2025	GSM8KHumanEval	—Unverified
Neural Networks Remember More: The Power of Parameter Isolation and Combination	Feb 16, 2025	Continual LearningTask Arithmetic	—Unverified
Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing	Feb 6, 2025	Task Arithmetic	—Unverified
Efficient Model Editing with Task Vector Bases: A Theoretical Framework and Scalable Approach	Feb 3, 2025	Model EditingNegation	CodeCode Available
Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts	Jan 25, 2025	NavigateTask Arithmetic	—Unverified
Soup to go: mitigating forgetting during continual learning with model averaging	Jan 9, 2025	Continual LearningTask Arithmetic	—Unverified
BADTV: Unveiling Backdoor Threats in Third-Party Task Vectors	Jan 4, 2025	Backdoor AttackTask Arithmetic	—Unverified
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach	Jan 1, 2025	Domain AdaptationDomain Generalization	—Unverified
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach	Dec 16, 2024	Language ModelingLanguage Modelling	—Unverified
Multi-Task Model Merging via Adaptive Weight Disentanglement	Nov 27, 2024	Disentanglementmodel	CodeCode Available
Task Arithmetic Through The Lens Of One-Shot Federated Learning	Nov 27, 2024	Federated LearningMulti-Task Learning	—Unverified
Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics	Nov 25, 2024	Knowledge DistillationMulti-Task Learning	—Unverified
ATM: Improving Model Merging by Alternating Tuning and Merging	Nov 5, 2024	Federated LearningMulti-Task Learning	—Unverified
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging	Oct 29, 2024	Mixture-of-ExpertsMulti-Task Learning	—Unverified
The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse	Oct 16, 2024	Linear Mode ConnectivityTask Arithmetic	—Unverified
What Matters for Model Merging at Scale?	Oct 4, 2024	modelTask Arithmetic	—Unverified

Show:10 25 50

← PrevPage 2 of 3Next →

No leaderboard results yet.