SOTAVerified

Stochastic Optimization

Stochastic Optimization is the task of optimizing certain objective functional by generating and using stochastic random variables. Usually the Stochastic Optimization is an iterative process of generating random variables that progressively finds out the minima or the maxima of the objective functional. Stochastic Optimization is usually applied in the non-convex functional spaces where the usual deterministic optimization such as linear or quadratic programming or their variants cannot be used.

Source: ASOC: An Adaptive Parameter-free Stochastic Optimization Techinique for Continuous Variables

Papers

Showing 10011050 of 1387 papers

TitleStatusHype
A Walk with SGD: How SGD Explores Regions of Deep Network Loss?0
Rethinking learning rate schedules for stochastic optimization0
Accelerating first order optimization algorithms0
signSGD via Zeroth-Order Oracle0
On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics0
The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure For Least SquaresCode0
Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio0
On the Convergence of Adam and BeyondCode0
Constrained Deep Networks: Lagrangian Optimization via Log-Barrier ExtensionsCode0
Adaptive Sequential Machine Learning0
Online Variance Reduction with MixturesCode0
An Upper Bound for Minimum True Matches in Graph Isomorphism with Simulated Annealing0
On the Influence of Bias-Correction on Distributed Stochastic Optimization0
Stochastic Optimization of Sorting Networks via Continuous RelaxationsCode0
The importance of better models in stochastic optimizationCode0
Traversing the noise of dynamic mini-batch sub-sampled loss functions: A visual guide0
Distributed stochastic optimization with gradient tracking over strongly-connected networks0
Inefficiency of K-FAC for Large Batch Size Training0
DeepOBS: A Deep Learning Optimizer Benchmark SuiteCode0
Accelerating Minibatch Stochastic Gradient Descent using Typicality Sampling0
An Empirical Study of Large-Batch Stochastic Gradient Descent with Structured Covariance Noise0
Active Probabilistic Inference on Matrices for Pre-Conditioning in Stochastic OptimizationCode0
Stochastic Conditional Gradient++0
Quantized Frank-Wolfe: Faster Optimization, Lower Communication, and Projection Free0
Evolutionary Algorithms for the Chance-Constrained Knapsack Problem0
The Complexity of Making the Gradient Small in Stochastic Convex Optimization0
Extreme Tensoring for Low-Memory Preconditioning0
An adaptive stochastic optimization algorithm for resource allocation0
Progressive Focus Search for the Static and Stochastic VRPTW with both Random Customers and Reveal Times0
Distribution-Dependent Analysis of Gibbs-ERM Principle0
Stochastic Zeroth-order Discretizations of Langevin Diffusions for Bayesian Inference0
Uniform-in-Time Weak Error Analysis for Stochastic Gradient Descent Algorithms via Diffusion Approximation0
Decentralized Stochastic Optimization and Gossip Algorithms with Compressed CommunicationCode0
Multilevel Monte Carlo Variational Inference0
Sharp Analysis for Nonconvex SGD Escaping from Saddle Points0
Stochastic Frank-Wolfe for Composite Convex MinimizationCode0
Reparameterizable Subset Sampling via Continuous RelaxationsCode0
Personalized Treatment Selection using Causal HeterogeneityCode0
Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex OptimizationCode0
Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise0
DADAM: A Consensus-based Distributed Adaptive Gradient Method for Online OptimizationCode0
New nonasymptotic convergence rates of stochastic proximal pointalgorithm for convex optimization problems0
Block-Randomized Stochastic Proximal Gradient for Low-Rank Tensor Factorization0
Stochastic Approximation Algorithms for Principal Component Analysis0
SPI-Optimizer: an integral-Separated PI Controller for Stochastic OptimizationCode0
Latent Dirichlet Allocation in Generative Adversarial Networks0
Stochastic Gradient Descent for Spectral Embedding with Implicit Orthogonality Constraint0
On stochastic gradient Langevin dynamics with dependent data streams in the logconcave case0
On Uncensored Mean First-Passage-Time Performance Experiments with Multiwalk in R^p: a New Stochastic Optimization Algorithm0
Stochastic Model Pruning via Weight Dropping Away and Back0
Show:102550
← PrevPage 21 of 28Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AvaGradAccuracy81.24Unverified
2AdaShiftAccuracy81.12Unverified
3Adam (eps-adjusted)Accuracy81.04Unverified
4SGDAccuracy80.95Unverified
5AdamWAccuracy79.87Unverified
6AdaBoundAccuracy77.24Unverified
#ModelMetricClaimedVerifiedStatus
1Adam (eps-adjusted)Accuracy96.36Unverified
2AvaGradAccuracy96.2Unverified
3SGDAccuracy96.14Unverified
4AdaShiftAccuracy95.92Unverified
5AdamWAccuracy95.89Unverified
6AdaBoundAccuracy94.6Unverified
#ModelMetricClaimedVerifiedStatus
1SGD - cosine LR scheduleAccuracy95.55Unverified
2LookaheadAccuracy95.27Unverified
3SGDAccuracy95.23Unverified
4ADAMAccuracy94.84Unverified
#ModelMetricClaimedVerifiedStatus
1AvaGradTop 1 Accuracy76.51Unverified
2SGDTop 1 Accuracy75.99Unverified
3AdamWTop 1 Accuracy72.9Unverified
4AdaBoundTop 1 Accuracy72.01Unverified
#ModelMetricClaimedVerifiedStatus
1AdaBoundBit per Character (BPC)2.86Unverified
2AdaShiftBit per Character (BPC)1.27Unverified
3AdamWBit per Character (BPC)1.23Unverified
4AvaGradBit per Character (BPC)1.18Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet18Accuracy (max)86.85Unverified
2Resnet34Accuracy (max)86.14Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet18Accuracy (max)58.48Unverified
2Resnet34Accuracy (max)54.5Unverified
#ModelMetricClaimedVerifiedStatus
1SGDTop 5 Accuracy92.15Unverified
2LookaheadTop 1 Accuracy75.13Unverified
#ModelMetricClaimedVerifiedStatus
1LookaheadTop 1 Accuracy75.49Unverified
2SGDTop 1 Accuracy75.15Unverified
#ModelMetricClaimedVerifiedStatus
1BertAccuracy (max)93.99Unverified
#ModelMetricClaimedVerifiedStatus
1BertAccuracy (max)86.34Unverified
#ModelMetricClaimedVerifiedStatus
1MLPNLL0.05Unverified