Task Arithmetic

A task vector specifies a direction in the weight space of a pre-trained model, such that movement in that direction improves performance on the task. We build task vectors by subtracting the weights of a pre-trained model from the weights of the same model after fine-tuning on a task. We show that these task vectors can be modified and combined together through arithmetic operations such as negation and addition, and the behavior of the resulting model is steered accordingly.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 61 papers

Title	Date	Tasks	Status	Hype
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach	Jan 1, 2025	Domain AdaptationDomain Generalization	—Unverified	0
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach	Dec 16, 2024	Language ModelingLanguage Modelling	—Unverified	0
Task Arithmetic Through The Lens Of One-Shot Federated Learning	Nov 27, 2024	Federated LearningMulti-Task Learning	—Unverified	0
Multi-Task Model Merging via Adaptive Weight Disentanglement	Nov 27, 2024	Disentanglementmodel	CodeCode Available	0
Task Singular Vectors: Reducing Task Interference in Model Merging	Nov 26, 2024	ClassificationImage Classification	CodeCode Available	2
Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics	Nov 25, 2024	Knowledge DistillationMulti-Task Learning	—Unverified	0
ATM: Improving Model Merging by Alternating Tuning and Merging	Nov 5, 2024	Federated LearningMulti-Task Learning	—Unverified	0
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging	Oct 29, 2024	Mixture-of-ExpertsMulti-Task Learning	—Unverified	0
The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse	Oct 16, 2024	Linear Mode ConnectivityTask Arithmetic	—Unverified	0
NegMerge: Consensual Weight Negation for Strong Machine Unlearning	Oct 8, 2024	image-classificationImage Classification	CodeCode Available	1
What Matters for Model Merging at Scale?	Oct 4, 2024	modelTask Arithmetic	—Unverified	0
Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration	Sep 27, 2024	Federated LearningKnowledge Distillation	CodeCode Available	0
Task Arithmetic for Language Expansion in Speech Translation	Sep 17, 2024	Machine TranslationTask Arithmetic	—Unverified	0
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic	Aug 24, 2024	Model CompressionTask Arithmetic	CodeCode Available	1
Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic	Jul 9, 2024	ClassificationDisentanglement	CodeCode Available	1
Knowledge Composition using Task Vectors with Learned Anisotropic Scaling	Jul 3, 2024	Task ArithmeticTest-time Adaptation	CodeCode Available	1
On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion	Jun 17, 2024	In-Context LearningTask Arithmetic	—Unverified	0
MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic	Jun 17, 2024	Computational EfficiencyMulti-Task Learning	—Unverified	0
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition	Jun 5, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
HPE-CogVLM: Advancing Vision Language Models with a Head Pose Grounding Task	Jun 4, 2024	Head Pose EstimationLanguage Modelling	—Unverified	0
Localizing Task Information for Improved Model Merging and Compression	May 13, 2024	Task Arithmetic	CodeCode Available	2
To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models	May 6, 2024	Adversarial AttackMemorization	—Unverified	0
No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement	Apr 24, 2024	Task ArithmeticTransfer Learning	CodeCode Available	0
Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging	Apr 8, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Ethos: Rectifying Language Models in Orthogonal Parameter Space	Mar 13, 2024	MemorizationTask Arithmetic	—Unverified	0

Show:10 25 50

← PrevPage 2 of 3Next →

No leaderboard results yet.