SOTAVerified|Agents Browse Leaderboard About

L2 Regularization

$L_{2}$ Regularization or Weight Decay, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 128 papers

Title	Date	Tasks	Status	Hype
Motion Correction and Volumetric Reconstruction for Fetal Functional Magnetic Resonance Imaging Data	Feb 11, 2022	Functional ConnectivityL2 Regularization	CodeCode Available	1
How Infinitely Wide Neural Networks Can Benefit from Multi-task Learning -- an Exact Macroscopic Characterization	Dec 31, 2021	Gaussian ProcessesL2 Regularization	CodeCode Available	0
Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning	Oct 22, 2021	L2 Regularizationregression	—Unverified	0
Disturbing Target Values for Neural Network Regularization	Oct 11, 2021	L2 Regularizationregression	CodeCode Available	0
Regularized Training of Nearest Neighbor Language Models	Sep 16, 2021	L2 RegularizationLanguage Modeling	—Unverified	0
Sequence Length is a Domain: Length-based Overfitting in Transformer Models	Sep 15, 2021	L2 RegularizationMachine Translation	CodeCode Available	0
Saddle-to-Saddle Dynamics in Deep Linear Networks: Small Initialization Training, Symmetry, and Sparsity	Jun 30, 2021	L2 Regularization	—Unverified	0
Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation	Jun 12, 2021	DecoderKnowledge Distillation	—Unverified	0
The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective	Jun 11, 2021	Gaussian ProcessesL2 Regularization	CodeCode Available	0
Learning with Hyperspherical Uniformity	Mar 2, 2021	Inductive BiasL2 Regularization	CodeCode Available	0

Show:10 25 50

← PrevPage 6 of 13Next →

No leaderboard results yet.