Related papers: Adaptive system optimization using random directio…

Random directions stochastic approximation with deterministic perturbations

We introduce deterministic perturbation schemes for the recently proposed random directions stochastic approximation (RDSA) [17], and propose new first-order and second-order algorithms. In the latter case, these are the first second-order…

Optimization and Control · Mathematics 2019-03-29 Prashanth L A , Shalabh Bhatnagar , Nirav Bhavsar , Michael Fu , Steven I. Marcus

Generalized Random Direction Newton Algorithms for Stochastic Optimization

We present a family of generalized Hessian estimators of the objective using random direction stochastic approximation (RDSA) by utilizing only noisy function measurements. The form of each estimator and the order of the bias depend on the…

Machine Learning · Computer Science 2026-02-24 Soumen Pachal , Prashanth L. A. , Shalabh Bhatnagar , Avinash Achar

Sub-Sampled Newton Methods I: Globally Convergent Algorithms

Large scale optimization problems are ubiquitous in machine learning and data analysis and there is a plethora of algorithms for solving such problems. Many of these algorithms employ sub-sampling, as a way to either speed up the…

Optimization and Control · Mathematics 2016-02-29 Farbod Roosta-Khorasani , Michael W. Mahoney

Stochastic quasi-Newton with line-search regularization

In this paper we present a novel quasi-Newton algorithm for use in stochastic optimisation. Quasi-Newton methods have had an enormous impact on deterministic optimisation problems because they afford rapid convergence and computationally…

Systems and Control · Electrical Eng. & Systems 2019-09-04 Adrian Wills , Thomas Schön

Theoretical Study and Comparison of SPSA and RDSA Algorithms

Stochastic approximation (SA) algorithms are widely used in system optimization problems when only noisy measurements of the system are available. This paper studies two types of SA algorithms in a multivariate Kiefer-Wolfowitz setting:…

Optimization and Control · Mathematics 2021-07-28 Yiwen Chen

Stochastic Analysis of an Adaptive Cubic Regularisation Method under Inexact Gradient Evaluations and Dynamic Hessian Accuracy

We here adapt an extended version of the adaptive cubic regularisation method with dynamic inexact Hessian information for nonconvex optimisation in [3] to the stochastic optimisation setting. While exact function evaluations are still…

Numerical Analysis · Mathematics 2020-09-15 Stefania Bellavia , Gianmarco Gurioli

Optimization with Discrete Simultaneous Perturbation Stochastic Approximation Using Noisy Loss Function Measurements

Discrete stochastic optimization considers the problem of minimizing (or maximizing) loss functions defined on discrete sets, where only noisy measurements of the loss functions are available. The discrete stochastic optimization problem is…

Optimization and Control · Mathematics 2013-11-04 Qi Wang

Stochastic Variance-Reduced Cubic Regularized Newton Method

We propose a stochastic variance-reduced cubic regularized Newton method for non-convex optimization. At the core of our algorithm is a novel semi-stochastic gradient along with a semi-stochastic Hessian, which are specifically designed for…

Machine Learning · Computer Science 2018-02-14 Dongruo Zhou , Pan Xu , Quanquan Gu

Robust First and Second-Order Differentiation for Regularized Optimal Transport

Applications such as unbalanced and fully shuffled regression can be approached by optimizing regularized optimal transport (OT) distances, such as the entropic OT and Sinkhorn distances. A common approach for this optimization is to use a…

Numerical Analysis · Mathematics 2024-10-22 Xingjie Li , Fei Lu , Molei Tao , Felix X. -F. Ye

Adaptive Regularized Newton Method with Inexact Hessian

Newton's method is the most widespread high-order method, demanding the gradient and the Hessian of the objective function. However, one of the main disadvantages of Newtons method is its lack of global convergence and high iteration cost.…

Optimization and Control · Mathematics 2025-12-10 Aleksandr Shestakov , Nail Bashirov , Andrei Semenov , Alexander Gasnikov , Martin Takáč , Aleksandr Beznosikov , Dmitry Kamzolov

On Adaptive Stochastic Optimization for Streaming Data: A Newton's Method with O(dN) Operations

Stochastic optimization methods encounter new challenges in the realm of streaming, characterized by a continuous flow of large, high-dimensional data. While first-order methods, like stochastic gradient descent, are the natural choice,…

Statistics Theory · Mathematics 2024-02-02 Antoine Godichon-Baggioni , Nicklas Werge

A Stochastic Semismooth Newton Method for Nonsmooth Nonconvex Optimization

In this work, we present a globalized stochastic semismooth Newton method for solving stochastic optimization problems involving smooth nonconvex and nonsmooth convex terms in the objective function. We assume that only noisy gradient and…

Optimization and Control · Mathematics 2018-03-12 Andre Milzarek , Xiantao Xiao , Shicong Cen , Zaiwen Wen , Michael Ulbrich

Policy Newton methods for Distortion Riskmetrics

We consider the problem of risk-sensitive control in a reinforcement learning (RL) framework. In particular, we aim to find a risk-optimal policy by maximizing the distortion riskmetric (DRM) of the discounted reward in a finite horizon…

Machine Learning · Computer Science 2025-08-12 Soumen Pachal , Mizhaan Prajit Maniyar , Prashanth L. A

Fast large-scale optimization by unifying stochastic gradient and quasi-Newton methods

We present an algorithm for minimizing a sum of functions that combines the computational efficiency of stochastic gradient descent (SGD) with the second order curvature information leveraged by quasi-Newton methods. We unify these…

Machine Learning · Computer Science 2014-12-02 Jascha Sohl-Dickstein , Ben Poole , Surya Ganguli

SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization

We propose a new algorithm for minimizing regularized empirical loss: Stochastic Dual Newton Ascent (SDNA). Our method is dual in nature: in each iteration we update a random subset of the dual variables. However, unlike existing methods…

Machine Learning · Computer Science 2015-02-10 Zheng Qu , Peter Richtárik , Martin Takáč , Olivier Fercoq

Finite-time analysis of Multi-timescale Stochastic Optimization Algorithms

We present a finite-time analysis of two smoothed functional stochastic approximation algorithms for simulation-based optimization. The first is a two time-scale gradient-based method, while the second is a three time-scale Newton-based…

Machine Learning · Computer Science 2026-04-01 Kaustubh Kartikey , Shalabh Bhatnagar

A stochastic Gauss-Newton algorithm for regularized semi-discrete optimal transport

We introduce a new second order stochastic algorithm to estimate the entropically regularized optimal transport cost between two probability measures. The source measure can be arbitrary chosen, either absolutely continuous or discrete,…

Statistics Theory · Mathematics 2022-03-03 Bernard Bercu , Jérémie Bigot , Sébastien Gadat , Emilia Siviero

Adaptive Stochastic Variance Reduction for Subsampled Newton Method with Cubic Regularization

The cubic regularized Newton method of Nesterov and Polyak has become increasingly popular for non-convex optimization because of its capability of finding an approximate local solution with second-order guarantee. Several recent works…

Optimization and Control · Mathematics 2018-11-29 Junyu Zhang , Lin Xiao , Shuzhong Zhang

An Approximate Newton Method for Markov Decision Processes

Gradient-based algorithms are one of the methods of choice for the optimisation of Markov Decision Processes. In this article we will present a novel approximate Newton algorithm for the optimisation of such models. The algorithm has…

Optimization and Control · Mathematics 2015-08-05 Thomas Furmston , David Barber

Stochastic Newton and Cubic Newton Methods with Simple Local Linear-Quadratic Rates

We present two new remarkably simple stochastic second-order methods for minimizing the average of a very large number of sufficiently smooth and strongly convex functions. The first is a stochastic variant of Newton's method (SN), and the…

Machine Learning · Computer Science 2019-12-04 Dmitry Kovalev , Konstantin Mishchenko , Peter Richtárik