SOTAVerified

L2 Regularization

See Weight Decay.

$L_{2}$ Regularization or Weight Decay, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Papers

Showing 110 of 128 papers

TitleStatusHype
Maintaining Plasticity in Deep Continual LearningCode2
The Transient Nature of Emergent In-Context Learning in TransformersCode1
Rotational Equilibrium: How Weight Decay Balances Learning Across Neural NetworksCode1
It's Enough: Relaxing Diagonal Constraints in Linear Autoencoders for RecommendationCode1
Motion Correction and Volumetric Reconstruction for Fetal Functional Magnetic Resonance Imaging DataCode1
Towards Unsupervised Deep Image Enhancement with Generative Adversarial NetworkCode1
Neural Pruning via Growing RegularizationCode1
Label-Only Membership Inference AttacksCode1
Distributionally Robust Neural NetworksCode1
Quantifying Generalization in Reinforcement LearningCode1
Show:102550
← PrevPage 1 of 13Next →

No leaderboard results yet.