SOTAVerified

Task Arithmetic

A task vector specifies a direction in the weight space of a pre-trained model, such that movement in that direction improves performance on the task. We build task vectors by subtracting the weights of a pre-trained model from the weights of the same model after fine-tuning on a task. We show that these task vectors can be modified and combined together through arithmetic operations such as negation and addition, and the behavior of the resulting model is steered accordingly.

Papers

Showing 4150 of 61 papers

TitleStatusHype
MCU: Improving Machine Unlearning through Mode Connectivity0
MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic0
Neural Networks Remember More: The Power of Parameter Isolation and Combination0
On Fairness of Task Arithmetic: The Role of Task Vectors0
On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion0
OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models0
Scalable Strategies for Continual Learning with Replay0
Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning0
Soup to go: mitigating forgetting during continual learning with model averaging0
Subspace-Boosted Model Merging0
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.