SOTAVerified

L2 Regularization

See Weight Decay.

$L_{2}$ Regularization or Weight Decay, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Papers

Showing 2650 of 128 papers

TitleStatusHype
Learning with Hyperspherical UniformityCode0
How Infinitely Wide Neural Networks Can Benefit from Multi-task Learning -- an Exact Macroscopic CharacterizationCode0
DACN: Dual-Attention Convolutional Network for Hyperspectral Image Super-ResolutionCode0
Gradient-based bilevel optimization for multi-penalty Ridge regression through matrix differential calculusCode0
Learning a smooth kernel regularizer for convolutional neural networksCode0
Neurogenesis-Inspired Dictionary Learning: Online Model Adaption in a Changing WorldCode0
From large-eddy simulations to deep learning: A U-net model for fast urban canopy flow predictionsCode0
What is the Effect of Importance Weighting in Deep Learning?Code0
Data-dependent Gaussian Prior Objective for Language Generation0
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning0
Customers Churn Prediction in Financial Institution Using Artificial Neural Network0
CtrTab: Tabular Data Synthesis with High-Dimensional and Limited Data0
An Experiment on Feature Selection using Logistic Regression0
Action Classification with Locality-constrained Linear Coding0
Correlated Initialization for Correlated Data0
A New Angle on L2 Regularization0
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks0
Comparative Study of Bitcoin Price Prediction0
An efficient distributed learning algorithm based on effective local functional approximations0
A Comparative Study of Neural Network Compression0
A Bayesian traction force microscopy method with automated denoising in a user-friendly software package0
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition0
Empirical Study on Airline Delay Analysis and Prediction0
Exponentially Weighted l_2 Regularization Strategy in Constructing Reinforced Second-order Fuzzy Rule-based Model0
Emergence of Implicit Filter Sparsity in Convolutional Neural Networks0
Show:102550
← PrevPage 2 of 6Next →

No leaderboard results yet.