| Towards Practical Second-Order Optimizers in Deep Learning: Insights from Fisher Information Analysis | Apr 26, 2025 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Automatic Gradient Descent: Deep Learning without Hyperparameters | Apr 11, 2023 | Deep LearningSecond-order methods | CodeCode Available | 2 |
| Differentiable Expected Hypervolume Improvement for Parallel Multi-Objective Bayesian Optimization | Jun 9, 2020 | Bayesian OptimizationSecond-order methods | CodeCode Available | 2 |
| SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation | Feb 25, 2025 | Second-order methods | CodeCode Available | 1 |
| Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning | Jun 5, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective | Feb 5, 2024 | Second-order methods | CodeCode Available | 1 |
| Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC | Dec 9, 2023 | Second-order methods | CodeCode Available | 1 |
| MKOR: Momentum-Enabled Kronecker-Factor-Based Optimizer Using Rank-1 Updates | Jun 2, 2023 | Second-order methods | CodeCode Available | 1 |
| Symmetry Teleportation for Accelerated Optimization | May 21, 2022 | Second-order methods | CodeCode Available | 1 |
| Near out-of-distribution detection for low-resolution radar micro-Doppler signatures | May 12, 2022 | Contrastive LearningGeometry-aware processing | CodeCode Available | 1 |
| Communication-Efficient Stochastic Zeroth-Order Optimization for Federated Learning | Jan 24, 2022 | Federated LearningSecond-order methods | CodeCode Available | 1 |
| Second-Order Neural ODE Optimizer | Sep 29, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| M-FAC: Efficient Matrix-Free Approximations of Second-Order Information | Jul 7, 2021 | Network PruningSecond-order methods | CodeCode Available | 1 |
| ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning | Jun 1, 2020 | BIG-bench Machine LearningSecond-order methods | CodeCode Available | 1 |
| MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation | Mar 27, 2020 | Bilevel OptimizationNeural Architecture Search | CodeCode Available | 1 |
| Low Rank Saddle Free Newton: A Scalable Method for Stochastic Nonconvex Optimization | Feb 7, 2020 | Second-order methods | CodeCode Available | 1 |
| Second-Order Stochastic Optimization for Machine Learning in Linear Time | Feb 12, 2016 | BIG-bench Machine LearningSecond-order methods | CodeCode Available | 1 |
| NysAct: A Scalable Preconditioned Gradient Descent using Nystrom Approximation | Jun 10, 2025 | Second-order methods | CodeCode Available | 0 |
| KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products | Jun 4, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Accelerated Training of Federated Learning via Second-Order Methods | May 29, 2025 | Federated LearningSecond-order methods | —Unverified | 0 |
| Representation Meets Optimization: Training PINNs and PIKANs for Gray-Box Discovery in Systems Pharmacology | Apr 10, 2025 | Computational EfficiencyKolmogorov-Arnold Networks | —Unverified | 0 |
| FedOSAA: Improving Federated Learning with One-Step Anderson Acceleration | Mar 14, 2025 | Federated LearningSecond-order methods | —Unverified | 0 |
| Online Covariance Matrix Estimation in Sketched Newton Methods | Feb 10, 2025 | parameter estimationSecond-order methods | CodeCode Available | 0 |
| A Survey of Optimization Methods for Training DL Models: Theoretical Perspective on Convergence and Generalization | Jan 24, 2025 | Distributed OptimizationNavigate | —Unverified | 0 |
| Distributed Quasi-Newton Method for Fair and Fast Federated Learning | Jan 18, 2025 | FairnessFederated Learning | —Unverified | 0 |
| Preconditioners for the Stochastic Training of Neural Fields | Jan 1, 2025 | Image ReconstructionNeRF | —Unverified | 0 |
| GP-FL: Model-Based Hessian Estimation for Second-Order Over-the-Air Federated Learning | Dec 5, 2024 | Federated LearningSecond-order methods | —Unverified | 0 |
| A Learn-to-Optimize Approach for Coordinate-Wise Step Sizes for Quasi-Newton Methods | Nov 25, 2024 | Second-order methods | —Unverified | 0 |
| Gradient Norm Regularization Second-Order Algorithms for Solving Nonconvex-Strongly Concave Minimax Problems | Nov 24, 2024 | Second-order methods | —Unverified | 0 |
| Don't Be So Positive: Negative Step Sizes in Second-Order Methods | Nov 18, 2024 | Second-order methods | —Unverified | 0 |
| Exact, Tractable Gauss-Newton Optimization in Deep Reversible Architectures Reveal Poor Generalization | Nov 12, 2024 | Second-order methods | CodeCode Available | 0 |
| Improving Stochastic Cubic Newton with Momentum | Oct 25, 2024 | Second-order methods | —Unverified | 0 |
| Second-Order Min-Max Optimization with Lazy Hessians | Oct 12, 2024 | Second-order methods | —Unverified | 0 |
| Unlocking FedNL: Self-Contained Compute-Optimized Implementation | Oct 11, 2024 | Federated LearningSecond-order methods | —Unverified | 0 |
| Adversarial Vulnerability as a Consequence of On-Manifold Inseparibility | Oct 9, 2024 | AttributeDimensionality Reduction | —Unverified | 0 |
| FLeNS: Federated Learning with Enhanced Nesterov-Newton Sketch | Sep 23, 2024 | Dimensionality ReductionEdge-computing | CodeCode Available | 0 |
| Alternating Iteratively Reweighted _1 and Subspace Newton Algorithms for Nonconvex Sparse Optimization | Jul 24, 2024 | Second-order methods | CodeCode Available | 0 |
| Fed-Sophia: A Communication-Efficient Second-Order Federated Learning Algorithm | Jun 10, 2024 | Federated LearningSecond-order methods | —Unverified | 0 |
| Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization | Jun 4, 2024 | Second-order methods | —Unverified | 0 |
| Stochastic Newton Proximal Extragradient Method | Jun 3, 2024 | Second-order methods | —Unverified | 0 |
| Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks | May 24, 2024 | Second-order methods | —Unverified | 0 |
| A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks | Apr 10, 2024 | DiversityImage Generation | CodeCode Available | 0 |
| Inverse-Free Fast Natural Gradient Descent Method for Deep Learning | Mar 6, 2024 | Deep Learningimage-classification | —Unverified | 0 |
| SGD with Partial Hessian for Deep Neural Networks Optimization | Mar 5, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Second Order Methods for Bandit Optimization and Control | Feb 14, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks | Feb 6, 2024 | Second-order methods | —Unverified | 0 |
| On The Temporal Domain of Differential Equation Inspired Graph Neural Networks | Jan 20, 2024 | Second-order methods | —Unverified | 0 |
| Krylov Cubic Regularized Newton: A Subspace Second-Order Method with Dimension-Free Convergence Rate | Jan 5, 2024 | Second-order methodssubspace methods | —Unverified | 0 |
| Adapting Newton's Method to Neural Networks through a Summary of Higher-Order Derivatives | Dec 6, 2023 | Second-order methods | CodeCode Available | 0 |
| A Computationally Efficient Sparsified Online Newton Method | Nov 16, 2023 | Second-order methods | CodeCode Available | 0 |