SOTAVerified

Stochastic Optimization

Stochastic Optimization is the task of optimizing certain objective functional by generating and using stochastic random variables. Usually the Stochastic Optimization is an iterative process of generating random variables that progressively finds out the minima or the maxima of the objective functional. Stochastic Optimization is usually applied in the non-convex functional spaces where the usual deterministic optimization such as linear or quadratic programming or their variants cannot be used.

Source: ASOC: An Adaptive Parameter-free Stochastic Optimization Techinique for Continuous Variables

Papers

Showing 12511300 of 1387 papers

TitleStatusHype
SEBOOST - Boosting Stochastic Learning Using Subspace Optimization TechniquesCode0
Data Dependent Convergence for Distributed Stochastic Optimization0
SGDR: Stochastic Gradient Descent with Warm RestartsCode1
Learning Structured Predictors from Bandit Feedback for Interactive NLP0
Stochastic Quasi-Newton Methods for Nonconvex Stochastic Optimization0
Accelerate Stochastic Subgradient Method by Leveraging Local Growth Condition0
Convolutional Sketch InversionCode0
Distributed stochastic optimization via matrix exponential learning0
Learning Weight Uncertainty With Stochastic Gradient MCMC for Shape Classification0
Simultaneous Clustering and Model Selection for Tensor Affinities0
CYCLADES: Conflict-free Asynchronous Machine LearningCode0
Adaptive Learning Rate via Covariance Matrix Based Preconditioning for Deep Neural Networks0
Stochastic Optimization for Large-scale Optimal Transport0
Kronecker Determinantal Point ProcessesCode0
Faster Eigenvector Computation via Shift-and-Invert Preconditioning0
Effective Blind Source Separation Based on the Adam Algorithm0
NESTT: A Nonconvex Primal-Dual Splitting Method for Distributed and Stochastic Optimization0
Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural NetworksCode0
Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds0
The Information-Collecting Vehicle Routing Problem: Stochastic Optimization for Emergency Storm Response0
Reducing the Model Order of Deep Neural Networks Using Information Theory0
Barzilai-Borwein Step Size for Stochastic Gradient DescentCode0
Nonconvex Sparse Learning via Stochastic Optimization with Progressive Variance Reduction0
Distributed stochastic optimization for deep learning (thesis)0
Algorithms for stochastic optimization with functional or expectation constraints0
Efficient Globally Convergent Stochastic Optimization for Canonical Correlation Analysis0
Revisiting Distributed Synchronous SGDCode1
Katyusha: The First Direct Acceleration of Stochastic Gradient Methods0
Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization0
Scalable Metric Learning via Weighted Approximate Rank Component Analysis0
Second-Order Stochastic Optimization for Machine Learning in Linear TimeCode1
Improved Dropout for Shallow and Deep Learning0
Variance-Reduced and Projection-Free Stochastic Optimization0
A Kronecker-factored approximate Fisher matrix for convolution layersCode0
On Deep Multi-View Representation Learning: Objectives and OptimizationCode0
SCOPE: Scalable Composite Optimization for Learning on SparkCode0
Variational Inference: A Review for StatisticiansCode1
Bridging the Gap between Stochastic Gradient MCMC and Stochastic OptimizationCode0
Efficient Distributed SGD with Variance Reduction0
Variance Reduction for Distributed Stochastic Gradient Descent0
A Projection Free Method for Generalized Eigenvalue Problem With a Nonsmooth Regularizer0
Adaptive Stochastic Optimization: From Sets to Paths0
Local Smoothness in Variance Reduced Optimization0
Fast Rates for Exp-concave Empirical Risk Minimization0
Asynchronous stochastic convex optimization: the noise is in the noise and SGD don't care0
Constant Time EXPected Similarity Estimation using Stochastic Optimization0
Joint Training of Generic CNN-CRF Models with Stochastic Optimization0
Large-Scale Approximate Kernel Canonical Correlation Analysis0
Seeing the Unseen Network: Inferring Hidden Social Ties from Respondent-Driven Sampling0
Online Principal Component Analysis in High Dimension: Which Algorithm to Choose?0
Show:102550
← PrevPage 26 of 28Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AvaGradAccuracy81.24Unverified
2AdaShiftAccuracy81.12Unverified
3Adam (eps-adjusted)Accuracy81.04Unverified
4SGDAccuracy80.95Unverified
5AdamWAccuracy79.87Unverified
6AdaBoundAccuracy77.24Unverified
#ModelMetricClaimedVerifiedStatus
1Adam (eps-adjusted)Accuracy96.36Unverified
2AvaGradAccuracy96.2Unverified
3SGDAccuracy96.14Unverified
4AdaShiftAccuracy95.92Unverified
5AdamWAccuracy95.89Unverified
6AdaBoundAccuracy94.6Unverified
#ModelMetricClaimedVerifiedStatus
1SGD - cosine LR scheduleAccuracy95.55Unverified
2LookaheadAccuracy95.27Unverified
3SGDAccuracy95.23Unverified
4ADAMAccuracy94.84Unverified
#ModelMetricClaimedVerifiedStatus
1AvaGradTop 1 Accuracy76.51Unverified
2SGDTop 1 Accuracy75.99Unverified
3AdamWTop 1 Accuracy72.9Unverified
4AdaBoundTop 1 Accuracy72.01Unverified
#ModelMetricClaimedVerifiedStatus
1AdaBoundBit per Character (BPC)2.86Unverified
2AdaShiftBit per Character (BPC)1.27Unverified
3AdamWBit per Character (BPC)1.23Unverified
4AvaGradBit per Character (BPC)1.18Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet18Accuracy (max)86.85Unverified
2Resnet34Accuracy (max)86.14Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet18Accuracy (max)58.48Unverified
2Resnet34Accuracy (max)54.5Unverified
#ModelMetricClaimedVerifiedStatus
1SGDTop 5 Accuracy92.15Unverified
2LookaheadTop 1 Accuracy75.13Unverified
#ModelMetricClaimedVerifiedStatus
1LookaheadTop 1 Accuracy75.49Unverified
2SGDTop 1 Accuracy75.15Unverified
#ModelMetricClaimedVerifiedStatus
1BertAccuracy (max)93.99Unverified
#ModelMetricClaimedVerifiedStatus
1BertAccuracy (max)86.34Unverified
#ModelMetricClaimedVerifiedStatus
1MLPNLL0.05Unverified