Related papers: Stochastic Optimization with Bandit Sampling

Online Variance Reduction for Stochastic Optimization

Modern stochastic optimization methods often rely on uniform sampling which is agnostic to the underlying characteristics of the data. This might degrade the convergence by yielding estimates that suffer from a high variance. A possible…

Machine Learning · Statistics 2018-06-07 Zalán Borsos , Andreas Krause , Kfir Y. Levy

Greedy methods, randomization approaches and multi-arm bandit algorithms for efficient sparsity-constrained optimization

Several sparsity-constrained algorithms such as Orthogonal Matching Pursuit or the Frank-Wolfe algorithm with sparsity constraints work by iteratively selecting a novel atom to add to the current non-zero set of variables. This selection…

Machine Learning · Computer Science 2016-08-23 A Rakotomamonjy , S Koço , Liva Ralaivola

Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization

Stochastic gradient methods for machine learning and optimization problems are usually analyzed assuming data points are sampled \emph{with} replacement. In practice, however, sampling \emph{without} replacement is very common, easier to…

Machine Learning · Computer Science 2016-10-18 Ohad Shamir

From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits

The stochastic multi-arm bandit problem has been extensively studied under standard assumptions on the arm's distribution (e.g bounded with known support, exponential family, etc). These assumptions are suitable for many real-world problems…

Machine Learning · Statistics 2021-11-19 Dorian Baudry , Patrick Saux , Odalric-Ambrym Maillard

Coordinate Descent with Bandit Sampling

Coordinate descent methods usually minimize a cost function by updating a random decision variable (corresponding to one coordinate) at a time. Ideally, we would update the decision variable that yields the largest decrease in the cost…

Machine Learning · Computer Science 2018-12-05 Farnood Salehi , Patrick Thiran , L. Elisa Celis

Combinatorial Stochastic-Greedy Bandit

We propose a novel combinatorial stochastic-greedy bandit (SGB) algorithm for combinatorial multi-armed bandit problems when no extra information other than the joint reward of the selected set of $n$ arms at each time step $t\in [T]$ is…

Machine Learning · Computer Science 2023-12-14 Fares Fourati , Christopher John Quinn , Mohamed-Slim Alouini , Vaneet Aggarwal

Taming Non-stationary Bandits: A Bayesian Approach

We consider the multi armed bandit problem in non-stationary environments. Based on the Bayesian method, we propose a variant of Thompson Sampling which can be used in both rested and restless bandit scenarios. Applying discounting to the…

Machine Learning · Statistics 2017-08-01 Vishnu Raj , Sheetal Kalyani

Adaptive Sampling Strategies for Stochastic Optimization

In this paper, we propose a stochastic optimization method that adaptively controls the sample size used in the computation of gradient approximations. Unlike other variance reduction techniques that either require additional storage or the…

Optimization and Control · Mathematics 2017-11-01 Raghu Bollapragada , Richard Byrd , Jorge Nocedal

An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization

The evaluation of hyperparameters, neural architectures, or data augmentation policies becomes a critical model selection problem in advanced deep learning with a large hyperparameter search space. In this paper, we propose an efficient and…

Machine Learning · Statistics 2020-12-17 Yimin Huang , Yujun Li , Hanrong Ye , Zhenguo Li , Zhihua Zhang

Improving Portfolio Optimization Results with Bandit Networks

In Reinforcement Learning (RL), multi-armed Bandit (MAB) problems have found applications across diverse domains such as recommender systems, healthcare, and finance. Traditional MAB algorithms typically assume stationary reward…

Artificial Intelligence · Computer Science 2024-10-10 Gustavo de Freitas Fonseca , Lucas Coelho e Silva , Paulo André Lima de Castro

Importance Sampling in Expensive Finite-Sum Optimization via Contextual Bandit Methods

In computational science workflows, it is often the case that 1) objective functions for optimization involve multiple simulation outputs, and 2) those simulations can be performed (at least partially) in parallel. In this work, we…

Optimization and Control · Mathematics 2026-05-28 Matt Menickelly

An Asymptotically Optimal Strategy for Constrained Multi-armed Bandit Problems

For the stochastic multi-armed bandit (MAB) problem from a constrained model that generalizes the classical one, we show that an asymptotic optimality is achievable by a simple strategy extended from the $\epsilon_t$-greedy strategy. We…

Optimization and Control · Mathematics 2018-05-04 Hyeong Soo Chang

Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite-Sum Structure

Stochastic optimization algorithms with variance reduction have proven successful for minimizing large finite sums of functions. Unfortunately, these techniques are unable to deal with stochastic perturbations of input data, induced for…

Machine Learning · Statistics 2017-11-16 Alberto Bietti , Julien Mairal

Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets

We study distributed optimization algorithms for minimizing the average of \emph{heterogeneous} functions distributed across several machines with a focus on communication efficiency. In such settings, naively using the classical stochastic…

Machine Learning · Computer Science 2020-11-18 Ilqar Ramazanli , Han Nguyen , Hai Pham , Sashank J. Reddi , Barnabas Poczos

Parallel optimized sampling for stochastic equations

Stochastic equations play an important role in computational science, due to their ability to treat a wide variety of complex statistical problems. However, current algorithms are strongly limited by their sampling variance, which scales…

Numerical Analysis · Mathematics 2017-01-04 Bogdan Opanchuk , Simon Kiesewetter , Peter D. Drummond

A Stochastic Gradient Method with Biased Estimation for Faster Nonconvex Optimization

A number of optimization approaches have been proposed for optimizing nonconvex objectives (e.g. deep learning models), such as batch gradient descent, stochastic gradient descent and stochastic variance reduced gradient descent. Theory…

Machine Learning · Computer Science 2019-05-15 Jia Bi , Steve R. Gunn

Bayesian Optimization for Categorical and Category-Specific Continuous Inputs

Many real-world functions are defined over both categorical and category-specific continuous variables and thus cannot be optimized by traditional Bayesian optimization (BO) methods. To optimize such functions, we propose a new method that…

Machine Learning · Computer Science 2019-12-02 Dang Nguyen , Sunil Gupta , Santu Rana , Alistair Shilton , Svetha Venkatesh

Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates

We provide a new understanding of the stochastic gradient bandit algorithm by showing that it converges to a globally optimal policy almost surely using \emph{any} constant learning rate. This result demonstrates that the stochastic…

Machine Learning · Computer Science 2025-02-12 Jincheng Mei , Bo Dai , Alekh Agarwal , Sharan Vaswani , Anant Raj , Csaba Szepesvari , Dale Schuurmans

Bayesian Unification of Gradient and Bandit-based Learning for Accelerated Global Optimisation

Bandit based optimisation has a remarkable advantage over gradient based approaches due to their global perspective, which eliminates the danger of getting stuck at local optima. However, for continuous optimisation problems or problems…

Artificial Intelligence · Computer Science 2017-05-30 Ole-Christoffer Granmo

Randomized Smoothing for Stochastic Optimization

We analyze convergence rates of stochastic optimization procedures for non-smooth convex optimization problems. By combining randomized smoothing techniques with accelerated gradient methods, we obtain convergence rates of stochastic…

Optimization and Control · Mathematics 2012-04-10 John C. Duchi , Peter L. Bartlett , Martin J. Wainwright