SOTAVerified

L2 Regularization

See Weight Decay.

$L_{2}$ Regularization or Weight Decay, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Papers

Showing 2650 of 128 papers

TitleStatusHype
Monkeypox disease recognition model based on improved SE-InceptionV3Code0
Convergence of a L2 regularized Policy Gradient Algorithm for the Multi Armed BanditCode0
An Experiment on Feature Selection using Logistic Regression0
Prevalidated ridge regression is a highly-efficient drop-in replacement for logistic regression for high-dimensional dataCode0
Reverse Engineering Deep ReLU Networks An Optimization-based Algorithm0
Gradient-based bilevel optimization for multi-penalty Ridge regression through matrix differential calculusCode0
The Transient Nature of Emergent In-Context Learning in TransformersCode1
On sparse regression, Lp-regularization, and automated model discovery0
Maintaining Plasticity in Continual Learning via Regenerative Regularization0
Less is More -- Towards parsimonious multi-task models using structured sparsityCode0
Maintaining Plasticity in Deep Continual LearningCode2
Dropout Regularization Versus _2-Penalization in the Linear Model0
Rotational Equilibrium: How Weight Decay Balances Learning Across Neural NetworksCode1
It's Enough: Relaxing Diagonal Constraints in Linear Autoencoders for RecommendationCode1
Electromyography Signal Classification Using Deep Learning0
Maximum margin learning of t-SPNs for cell classification with filtered input0
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition0
Planting and Mitigating Memorized Content in Predictive-Text Language ModelsCode0
Globally Gated Deep Linear Networks0
Linking Neural Collapse and L2 Normalization with Improved Out-of-Distribution Detection in Deep Neural Networks0
On the utility and protection of optimization with differential privacy and classic regularization techniques0
Perturbation of Deep Autoencoder Weights for Model Compression and Classification of Tabular Data0
Guidelines for the Regularization of Gammas in Batch Normalization for Deep Residual Networks0
A Note on the Regularity of Images Generated by Convolutional Neural Networks0
A Closer Look at Rehearsal-Free Continual Learning0
Show:102550
← PrevPage 2 of 6Next →

No leaderboard results yet.