SOTAVerified

L2 Regularization

See Weight Decay.

$L_{2}$ Regularization or Weight Decay, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Papers

Showing 3140 of 128 papers

TitleStatusHype
Analysis of High-dimensional Gaussian Labeled-unlabeled Mixture Model via Message-passing Algorithm0
A Bayesian encourages dropout0
Dropout Regularization Versus _2-Penalization in the Linear Model0
Automatic Discovery and Optimization of Parts for Image Classification0
Derivative-based regularization for regression0
Deep Optimization model for Screen Content Image Quality Assessment using Neural Networks0
Attentive Recurrent Tensor Model for Community Question Answering0
Distribution-Dependent Sample Complexity of Large Margin Learning0
Automatic Parameter Tying in Neural Networks0
A MAX-AFFINE SPLINE PERSPECTIVE OF RECURRENT NEURAL NETWORKS0
Show:102550
← PrevPage 4 of 13Next →

No leaderboard results yet.