Related papers: HAMSI: A Parallel Incremental Optimization Algorit…

Sub-Sampled Newton Methods I: Globally Convergent Algorithms

Large scale optimization problems are ubiquitous in machine learning and data analysis and there is a plethora of algorithms for solving such problems. Many of these algorithms employ sub-sampling, as a way to either speed up the…

Optimization and Control · Mathematics 2016-02-29 Farbod Roosta-Khorasani , Michael W. Mahoney

Parallel splitting method for large-scale quadratic programs

Current algorithms for large-scale industrial optimization problems typically face a trade-off: they either require exponential time to reach optimal solutions, or employ problem-specific heuristics. To overcome these limitations, we…

Quantum Physics · Physics 2025-10-16 Matteo Vandelli , Francesco Ferrari , Daniele Dragoni

Multi-Resolution Online Deterministic Annealing: A Hierarchical and Progressive Learning Architecture

Hierarchical learning algorithms that gradually approximate a solution to a data-driven optimization problem are essential to decision-making systems, especially under limitations on time and computational resources. In this study, we…

Machine Learning · Computer Science 2023-03-22 Christos Mavridis , John Baras

Massively Parallel Probabilistic Computing with Sparse Ising Machines

Inspired by the developments in quantum computing, building domain-specific classical hardware to solve computationally hard problems has received increasing attention. Here, by introducing systematic sparsification techniques, we…

Emerging Technologies · Computer Science 2022-06-03 Navid Anjum Aadit , Andrea Grimaldi , Mario Carpentieri , Luke Theogarajan , John M. Martinis , Giovanni Finocchio , Kerem Y. Camsari

A Class of Parallel Doubly Stochastic Algorithms for Large-Scale Learning

We consider learning problems over training sets in which both, the number of training examples and the dimension of the feature vectors, are large. To solve these problems we propose the random parallel stochastic algorithm (RAPSA). We…

Machine Learning · Computer Science 2016-06-17 Aryan Mokhtari , Alec Koppel , Alejandro Ribeiro

Practical Inexact Proximal Quasi-Newton Method with Global Complexity Analysis

Recently several methods were proposed for sparse optimization which make careful use of second-order information [10, 28, 16, 3] to improve local convergence rates. These methods construct a composite quadratic approximation using Hessian…

Machine Learning · Computer Science 2015-07-15 Katya Scheinberg , Xiaocheng Tang

Stochastic halfspace approximation method for convex optimization with nonsmooth functional constraints

In this work, we consider convex optimization problems with smooth objective function and nonsmooth functional constraints. We propose a new stochastic gradient algorithm, called Stochastic Halfspace Approximation Method (SHAM), to solve…

Optimization and Control · Mathematics 2024-12-04 Nitesh Kumar Singh , Ion Necoara

Computing High-Degree Polynomial Gradients in Memory

Specialized function gradient computing hardware could greatly improve the performance of state-of-the-art optimization algorithms, e.g., based on gradient descent or conjugate gradient methods that are at the core of control, machine…

Emerging Technologies · Computer Science 2024-01-30 T. Bhattacharya , G. H. Hutchinson , G. Pedretti , X. Sheng , J. Ignowski , T. Van Vaerenbergh , R. Beausoleil , J. P. Strachan , D. B. Strukov

Low-Order Explicit Hessian Imitation Method for Large-Scale Supervised Machine Learning

An algorithm is proposed for solving optimization problems arising in neural network training for supervised learning. The unique feature of the algorithm is the use of an auxiliary loss, in addition to the original loss employed for model…

Optimization and Control · Mathematics 2026-05-11 Yunlang Zhu , Lingjun Guo , Zahra Khatti , Xiaoyi Qu , Chia-Yuan Wu , Lara Zebiane , Frank E. Curtis

An iterative algorithm for high-dimensional linear models with both sparse and non-sparse structures

Numerous practical medical problems often involve data that possess a combination of both sparse and non-sparse structures. Traditional penalized regularizations techniques, primarily designed for promoting sparsity, are inadequate to…

Methodology · Statistics 2023-11-10 Shun Yu , Yuehan Yang

On the rates of convergence of Parallelized Averaged Stochastic Gradient Algorithms

The growing interest for high dimensional and functional data analysis led in the last decade to an important research developing a consequent amount of techniques. Parallelized algorithms, which consist in distributing and treat the data…

Statistics Theory · Mathematics 2017-10-24 Antoine Godichon-Baggioni , Sofiane Saadane

Closing the Computational-Query Depth Gap in Parallel Stochastic Convex Optimization

We develop a new parallel algorithm for minimizing Lipschitz, convex functions with a stochastic subgradient oracle. The total number of queries made and the query depth, i.e., the number of parallel rounds of queries, match the prior…

Optimization and Control · Mathematics 2024-06-12 Arun Jambulapati , Aaron Sidford , Kevin Tian

Optimization Methods for Joint Eigendecomposition

Joint diagonalization, the process of finding a shared set of approximate eigenvectors for a collection of matrices, arises in diverse applications such as multidimensional harmonic analysis or quantum information theory. This task is…

Optimization and Control · Mathematics 2025-02-12 Erik Troedsson , Marcus Carlsson , Herwig Wendt

A Proximal Stochastic Quasi-Newton Algorithm

In this paper, we discuss the problem of minimizing the sum of two convex functions: a smooth function plus a non-smooth function. Further, the smooth part can be expressed by the average of a large number of smooth component functions, and…

Machine Learning · Computer Science 2016-11-17 Luo Luo , Zihao Chen , Zhihua Zhang , Wu-Jun Li

An Inexact Variable Metric Proximal Point Algorithm for Generic Quasi-Newton Acceleration

We propose an inexact variable-metric proximal point algorithm to accelerate gradient-based optimization algorithms. The proposed scheme, called QNing can be notably applied to incremental first-order methods such as the stochastic…

Machine Learning · Statistics 2019-01-30 Hongzhou Lin , Julien Mairal , Zaid Harchaoui

Engineering a Scalable High Quality Graph Partitioner

We describe an approach to parallel graph partitioning that scales to hundreds of processors and produces a high solution quality. For example, for many instances from Walshaw's benchmark collection we improve the best known partitioning.…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-04-08 Manuel Holtgrewe , Peter Sanders , Christian Schulz

Fast large-scale optimization by unifying stochastic gradient and quasi-Newton methods

We present an algorithm for minimizing a sum of functions that combines the computational efficiency of stochastic gradient descent (SGD) with the second order curvature information leveraged by quasi-Newton methods. We unify these…

Machine Learning · Computer Science 2014-12-02 Jascha Sohl-Dickstein , Ben Poole , Surya Ganguli

Parallel Stochastic Optimization Framework for Large-Scale Non-Convex Stochastic Problems

In this paper, we consider the problem of stochastic optimization, where the objective function is in terms of the expectation of a (possibly non-convex) cost function that is parametrized by a random variable. While the convergence speed…

Information Theory · Computer Science 2019-10-23 Naeimeh Omidvar , An Liu , Vincent Lau , Danny H. K. Tsang , Mohammad Reza Pakravan

A Split-Client Approach to Second-Order Optimization

Second-order optimization methods offer superior convergence rates but are often bottlenecked by the wall-clock cost of Hessian computation and factorization. In the moderate-dimensional regime where the full Hessian fits in memory,…

Optimization and Control · Mathematics 2026-05-18 El Mahdi Chayti , Martin Jaggi

Accelerated projected gradient algorithms for sparsity constrained optimization problems

We consider the projected gradient algorithm for the nonconvex best subset selection problem that minimizes a given empirical loss function under an $\ell_0$-norm constraint. Through decomposing the feasible set of the given sparsity…

Optimization and Control · Mathematics 2026-02-13 Jan Harold Alcantara , Ching-pei Lee