SOTAVerified

Stochastic Optimization

Stochastic Optimization is the task of optimizing certain objective functional by generating and using stochastic random variables. Usually the Stochastic Optimization is an iterative process of generating random variables that progressively finds out the minima or the maxima of the objective functional. Stochastic Optimization is usually applied in the non-convex functional spaces where the usual deterministic optimization such as linear or quadratic programming or their variants cannot be used.

Source: ASOC: An Adaptive Parameter-free Stochastic Optimization Techinique for Continuous Variables

Papers

Showing 13511387 of 1387 papers

TitleStatusHype
Natural Gradient Hybrid Variational Inference with Application to Deep Mixed ModelsCode0
Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage PlanningCode0
Clipping Improves Adam-Norm and AdaGrad-Norm when the Noise Is Heavy-TailedCode0
Adversarial Subspace Generation for Outlier Detection in High-Dimensional DataCode0
Near-Optimal Algorithms for Making the Gradient Small in Stochastic Minimax OptimizationCode0
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language ModelCode0
Towards Faster Decentralized Stochastic Optimization with Communication CompressionCode0
High Probability Complexity Bounds for Non-Smooth Stochastic Optimization with Heavy-Tailed NoiseCode0
Stochastic Optimization of Sorting Networks via Continuous RelaxationsCode0
Binary Search and First Order Gradient Based Method for Stochastic OptimizationCode0
Inexact bilevel stochastic gradient methods for constrained and unconstrained lower-level problemsCode0
Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networksCode0
Stochastic Gradient Descent with Biased but Consistent Gradient EstimatorsCode0
Graph Pattern Mining and Learning through User-defined Relations (Extended Version)Code0
Neural Characteristic Activation Analysis and Geometric Parameterization for ReLU NetworksCode0
Greedy Step Averaging: A parameter-free stochastic optimization methodCode0
Practical Precoding via Asynchronous Stochastic Successive Convex ApproximationCode0
Byzantine-Robust Decentralized Stochastic Optimization over Static and Time-Varying NetworksCode0
Zeroth-Order Stochastic Variance Reduction for Nonconvex OptimizationCode0
Stochastic Optimization under Distributional DriftCode0
Neural Operator Variational Inference based on Regularized Stein Discrepancy for Deep Gaussian ProcessesCode0
AmCLR: Unified Augmented Learning for Cross-Modal RepresentationsCode0
A Kronecker-factored approximate Fisher matrix for convolution layersCode0
Balancing Forecast Accuracy and Switching Costs in Online Optimization of Energy Management SystemsCode0
Noise Stability Optimization for Finding Flat Minima: A Hessian-based Regularization ApproachCode0
Predictor-corrector algorithms for stochastic optimization under gradual distribution shiftCode0
Adaptive Consensus Gradients Aggregation for Scaled Distributed TrainingCode0
Non-asymptotic convergence analysis of the stochastic gradient Hamiltonian Monte Carlo algorithm with discontinuous stochastic gradient with applications to training of ReLU neural networksCode0
Non-asymptotic estimates for TUSLA algorithm for non-convex learning with applications to neural networks with ReLU activation functionCode0
Joint control variate for faster black-box variational inferenceCode0
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-ScaleCode0
Stochastic optimization with arbitrary recurrent data samplingCode0
Prior Specification for Bayesian Matrix Factorization via Prior Predictive MatchingCode0
Universal Boosting Variational InferenceCode0
A Kernel Mean Embedding Approach to Reducing Conservativeness in Stochastic Programming and ControlCode0
Non-cooperative Aerial Base Station Placement via Stochastic OptimizationCode0
SpectralNet: Spectral Clustering using Deep Neural NetworksCode0
Show:102550
← PrevPage 28 of 28Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AvaGradAccuracy81.24Unverified
2AdaShiftAccuracy81.12Unverified
3Adam (eps-adjusted)Accuracy81.04Unverified
4SGDAccuracy80.95Unverified
5AdamWAccuracy79.87Unverified
6AdaBoundAccuracy77.24Unverified
#ModelMetricClaimedVerifiedStatus
1Adam (eps-adjusted)Accuracy96.36Unverified
2AvaGradAccuracy96.2Unverified
3SGDAccuracy96.14Unverified
4AdaShiftAccuracy95.92Unverified
5AdamWAccuracy95.89Unverified
6AdaBoundAccuracy94.6Unverified
#ModelMetricClaimedVerifiedStatus
1SGD - cosine LR scheduleAccuracy95.55Unverified
2LookaheadAccuracy95.27Unverified
3SGDAccuracy95.23Unverified
4ADAMAccuracy94.84Unverified
#ModelMetricClaimedVerifiedStatus
1AvaGradTop 1 Accuracy76.51Unverified
2SGDTop 1 Accuracy75.99Unverified
3AdamWTop 1 Accuracy72.9Unverified
4AdaBoundTop 1 Accuracy72.01Unverified
#ModelMetricClaimedVerifiedStatus
1AdaBoundBit per Character (BPC)2.86Unverified
2AdaShiftBit per Character (BPC)1.27Unverified
3AdamWBit per Character (BPC)1.23Unverified
4AvaGradBit per Character (BPC)1.18Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet18Accuracy (max)86.85Unverified
2Resnet34Accuracy (max)86.14Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet18Accuracy (max)58.48Unverified
2Resnet34Accuracy (max)54.5Unverified
#ModelMetricClaimedVerifiedStatus
1SGDTop 5 Accuracy92.15Unverified
2LookaheadTop 1 Accuracy75.13Unverified
#ModelMetricClaimedVerifiedStatus
1LookaheadTop 1 Accuracy75.49Unverified
2SGDTop 1 Accuracy75.15Unverified
#ModelMetricClaimedVerifiedStatus
1BertAccuracy (max)93.99Unverified
#ModelMetricClaimedVerifiedStatus
1BertAccuracy (max)86.34Unverified
#ModelMetricClaimedVerifiedStatus
1MLPNLL0.05Unverified