Related papers: Doubly Random Parallel Stochastic Methods for Larg…

A Class of Parallel Doubly Stochastic Algorithms for Large-Scale Learning

We consider learning problems over training sets in which both, the number of training examples and the dimension of the feature vectors, are large. To solve these problems we propose the random parallel stochastic algorithm (RAPSA). We…

Machine Learning · Computer Science 2016-06-17 Aryan Mokhtari , Alec Koppel , Alejandro Ribeiro

Parallel optimized sampling for stochastic equations

Stochastic equations play an important role in computational science, due to their ability to treat a wide variety of complex statistical problems. However, current algorithms are strongly limited by their sampling variance, which scales…

Numerical Analysis · Mathematics 2017-01-04 Bogdan Opanchuk , Simon Kiesewetter , Peter D. Drummond

Parallel Randomized Algorithm for Chance Constrained Program

Chance constrained program is computationally intractable due to the existence of chance constraints, which are randomly disturbed and should be satisfied with a probability. This paper proposes a two-layer randomized algorithm to address…

Optimization and Control · Mathematics 2019-11-11 Xun Shen , Jiancang Zhuang , Xingguo Zhang

Parallel Stochastic Optimization Framework for Large-Scale Non-Convex Stochastic Problems

In this paper, we consider the problem of stochastic optimization, where the objective function is in terms of the expectation of a (possibly non-convex) cost function that is parametrized by a random variable. While the convergence speed…

Information Theory · Computer Science 2019-10-23 Naeimeh Omidvar , An Liu , Vincent Lau , Danny H. K. Tsang , Mohammad Reza Pakravan

Hybrid Approach to Parallel Stochastic Gradient Descent

Stochastic Gradient Descent is used for large datasets to train models to reduce the training time. On top of that data parallelism is widely used as a method to efficiently train neural networks using multiple worker nodes in parallel.…

Machine Learning · Computer Science 2024-07-02 Aakash Sudhirbhai Vora , Dhrumil Chetankumar Joshi , Aksh Kantibhai Patel

On the rates of convergence of Parallelized Averaged Stochastic Gradient Algorithms

The growing interest for high dimensional and functional data analysis led in the last decade to an important research developing a consequent amount of techniques. Parallelized algorithms, which consist in distributing and treat the data…

Statistics Theory · Mathematics 2017-10-24 Antoine Godichon-Baggioni , Sofiane Saadane

Randomized Block Proximal Methods for Distributed Stochastic Big-Data Optimization

In this paper we introduce a class of novel distributed algorithms for solving stochastic big-data convex optimization problems over directed graphs. In the addressed set-up, the dimension of the decision variable can be extremely high and…

Optimization and Control · Mathematics 2020-10-06 Francesco Farina , Giuseppe Notarstefano

Parallel algorithms and probability of large deviation for stochastic optimization problems

We consider convex stochastic optimization problems under different assumptions on the properties of available stochastic subgradient. It is known that, if the value of the objective function is available, one can obtain, in parallel,…

Optimization and Control · Mathematics 2017-01-19 Pavel Dvurechensky , Alexander Gasnikov , Anastasia Lagunovskaya

Distributed Stochastic Optimization of the Regularized Risk

Many machine learning algorithms minimize a regularized risk, and stochastic optimization is widely used for this task. When working with massive data, it is desirable to perform stochastic optimization in parallel. Unfortunately, many…

Machine Learning · Statistics 2023-11-27 Shin Matsushima , Hyokun Yun , Xinhua Zhang , S. V. N. Vishwanathan

Stochastic Training of Neural Networks via Successive Convex Approximations

This paper proposes a new family of algorithms for training neural networks (NNs). These are based on recent developments in the field of non-convex optimization, going under the general name of successive convex approximation (SCA)…

Machine Learning · Statistics 2017-06-16 Simone Scardapane , Paolo Di Lorenzo

Parallel Coordinate Descent Methods for Big Data Optimization

In this work we show that randomized (block) coordinate descent methods can be accelerated by parallelization when applied to the problem of minimizing the sum of a partially separable smooth convex function and a simple separable convex…

Optimization and Control · Mathematics 2013-11-27 Peter Richtárik , Martin Takáč

Comparison-Based Algorithms for One-Dimensional Stochastic Convex Optimization

Stochastic optimization finds a wide range of applications in operations research and management science. However, existing stochastic optimization techniques usually require the information of random samples (e.g., demands in the…

Optimization and Control · Mathematics 2019-04-18 Xi Chen , Qihang Lin , Zizhuo Wang

DSA: Decentralized Double Stochastic Averaging Gradient Algorithm

This paper considers convex optimization problems where nodes of a network have access to summands of a global objective. Each of these local objectives is further assumed to be an average of a finite set of functions. The motivation for…

Optimization and Control · Mathematics 2015-06-16 Aryan Mokhtari , Alejandro Ribeiro

Fast Parallel Algorithms for Statistical Subset Selection Problems

In this paper, we propose a new framework for designing fast parallel algorithms for fundamental statistical subset selection tasks that include feature selection and experimental design. Such tasks are known to be weakly submodular and are…

Machine Learning · Computer Science 2021-04-02 Sharon Qian , Yaron Singer

Large-Scale Stochastic Learning using GPUs

In this work we propose an accelerated stochastic learning system for very large-scale applications. Acceleration is achieved by mapping the training algorithm onto massively parallel processors: we demonstrate a parallel, asynchronous GPU…

Machine Learning · Computer Science 2017-02-24 Thomas Parnell , Celestine Dünner , Kubilay Atasu , Manolis Sifalakis , Haris Pozidis

Balancing Rates and Variance via Adaptive Batch-Size for Stochastic Optimization Problems

Stochastic gradient descent is a canonical tool for addressing stochastic optimization problems, and forms the bedrock of modern machine learning and statistics. In this work, we seek to balance the fact that attenuating step-size is…

Signal Processing · Electrical Eng. & Systems 2020-07-10 Zhan Gao , Alec Koppel , Alejandro Ribeiro

Random directions stochastic approximation with deterministic perturbations

We introduce deterministic perturbation schemes for the recently proposed random directions stochastic approximation (RDSA) [17], and propose new first-order and second-order algorithms. In the latter case, these are the first second-order…

Optimization and Control · Mathematics 2019-03-29 Prashanth L A , Shalabh Bhatnagar , Nirav Bhavsar , Michael Fu , Steven I. Marcus

Probabilistic Synchronous Parallel

Most machine learning and deep neural network algorithms rely on certain iterative algorithms to optimise their utility/cost functions, e.g. Stochastic Gradient Descent. In distributed learning, the networked nodes have to work…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-06 Liang Wang , Ben Catterall , Richard Mortier

HAMSI: A Parallel Incremental Optimization Algorithm Using Quadratic Approximations for Solving Partially Separable Problems

We propose HAMSI (Hessian Approximated Multiple Subsets Iteration), which is a provably convergent, second order incremental algorithm for solving large-scale partially separable optimization problems. The algorithm is based on a local…

Machine Learning · Statistics 2017-08-07 Kamer Kaya , Figen Öztoprak , Ş. İlker Birbil , A. Taylan Cemgil , Umut Şimşekli , Nurdan Kuru , Hazal Koptagel , M. Kaan Öztürk

Projecting "better than randomly": How to reduce the dimensionality of very large datasets in a way that outperforms random projections

For very large datasets, random projections (RP) have become the tool of choice for dimensionality reduction. This is due to the computational complexity of principal component analysis. However, the recent development of randomized…

Machine Learning · Statistics 2019-01-04 Michael Wojnowicz , Di Zhang , Glenn Chisholm , Xuan Zhao , Matt Wolff