SOTAVerified

L2 Regularization

See Weight Decay.

$L_{2}$ Regularization or Weight Decay, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Papers

Showing 101110 of 128 papers

TitleStatusHype
A Closer Look at Rehearsal-Free Continual Learning0
A Comparative Study of Neural Network Compression0
Action Classification with Locality-constrained Linear Coding0
Adaptive Estimators Show Information Compression in Deep Neural Networks0
A MAX-AFFINE SPLINE PERSPECTIVE OF RECURRENT NEURAL NETWORKS0
Analysis of High-dimensional Gaussian Labeled-unlabeled Mixture Model via Message-passing Algorithm0
Analysis of overfitting in the regularized Cox model0
An efficient distributed learning algorithm based on effective local functional approximations0
A New Angle on L2 Regularization0
An Experiment on Feature Selection using Logistic Regression0
Show:102550
← PrevPage 11 of 13Next →

No leaderboard results yet.