SOTAVerified

Second-order methods

Use second-order statistics to process data.

Papers

Showing 101150 of 181 papers

TitleStatusHype
Second-Order Information in Non-Convex Stochastic Optimization: Power and Limitations0
Second-Order Kernel Online Convex Optimization with Adaptive Sketching0
Second Order Methods for Bandit Optimization and Control0
Second-Order Min-Max Optimization with Lazy Hessians0
Second-order Neural Network Training Using Complex-step Directional Derivative0
Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study0
SLIM-QN: A Stochastic, Light, Momentumized Quasi-Newton Optimizer for Deep Neural Networks0
SONIA: A Symmetric Blockwise Truncated Optimization Algorithm0
SP2: A Second Order Stochastic Polyak Method0
Stochastic Dimension-reduced Second-order Methods for Policy Optimization0
Noise and Fluctuation of Finite Learning Rate Stochastic Gradient Descent0
Stochastic Heavy Ball0
Stochastic Newton Proximal Extragradient Method0
Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function0
Stochastic Quasi-Newton Langevin Monte Carlo0
Stochastic Second-order Methods for Non-convex Optimization with Inexact Hessian and Gradient0
Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function0
Stochastic Subspace Cubic Newton Method0
Structured second-order methods via natural gradient descent0
Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods0
Sub-Sampled Newton Methods II: Local Convergence Rates0
Sub-sampled Newton Methods with Non-uniform Sampling0
The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks0
The Many Faces of Exponential Weights in Online Learning0
Unlocking FedNL: Self-Contained Compute-Optimized Implementation0
Utility Maximization for Large-Scale Cell-Free Massive MIMO Downlink0
When Does Preconditioning Help or Hurt Generalization?0
KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products0
A block coordinate descent optimizer for classification problems exploiting convexity0
Accelerated Projected Gradient Method for the Optimization of Cell-Free Massive MIMO Downlink0
Accelerated Training of Federated Learning via Second-Order Methods0
Accelerating SGD for Distributed Deep-Learning Using Approximated Hessian Matrix0
Accelerating Stochastic Probabilistic Inference0
A Chaos Theory Approach to Understand Neural Network Optimization0
A comparison of second-order methods for deep convolutional neural networks0
Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization0
A Learn-to-Optimize Approach for Coordinate-Wise Step Sizes for Quasi-Newton Methods0
Adaptive Optimization Algorithms for Machine Learning0
Adaptive Second Order Coresets for Data-efficient Machine Learning0
Adaptive Single-Pass Stochastic Gradient Descent in Input Sparsity Time0
A Decentralized Quasi-Newton Method for Dual Formulations of Consensus Optimization0
A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training0
A Distributed Second-Order Algorithm You Can Trust0
Adversarial Vulnerability as a Consequence of On-Manifold Inseparibility0
A Flexible Tensor Block Coordinate Ascent Scheme for Hypergraph Matching0
A Generic Approach for Escaping Saddle points0
Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems0
A Homogenization Approach for Gradient-Dominated Stochastic Optimization0
Alternating direction method of multipliers for regularized multiclass support vector machines0
A Mini-Block Fisher Method for Deep Neural Networks0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.