SOTAVerified

L2 Regularization

See Weight Decay.

$L_{2}$ Regularization or Weight Decay, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Papers

Showing 110 of 128 papers

TitleStatusHype
Overcoming catastrophic forgetting in neural networks0
From large-eddy simulations to deep learning: A U-net model for fast urban canopy flow predictionsCode0
DACN: Dual-Attention Convolutional Network for Hyperspectral Image Super-ResolutionCode0
Geometry of Learning -- L2 Phase Transitions in Deep and Shallow Neural Networks0
Deep Learning in Renewable Energy Forecasting: A Cross-Dataset Evaluation of Temporal and Spatial Models0
Understand the Effect of Importance Weighting in Deep Learning on Dataset Shift0
Semantic segmentation for building houses from wooden cubes0
GPT Meets Graphs and KAN Splines: Testing Novel Frameworks on Multitask Fine-Tuned GPT-2 with LoRA0
CtrTab: Tabular Data Synthesis with High-Dimensional and Limited Data0
Low-rank bias, weight decay, and model merging in neural networks0
Show:102550
← PrevPage 1 of 13Next →

No leaderboard results yet.