Related papers: Newton-like method with diagonal correction for di…

Distributed Newton Methods for Deep Neural Networks

Deep learning involves a difficult non-convex optimization problem with a large number of weights between any two adjacent layers of a deep structure. To handle large data sets or complicated networks, distributed training is needed, but…

Machine Learning · Statistics 2018-02-02 Chien-Chih Wang , Kent Loong Tan , Chun-Ting Chen , Yu-Hsiang Lin , S. Sathiya Keerthi , Dhruv Mahajan , S. Sundararajan , Chih-Jen Lin

Network Newton-Part II: Convergence Rate and Implementation

The use of network Newton methods for the decentralized optimization of a sum cost distributed through agents of a network is considered. Network Newton methods reinterpret distributed gradient descent as a penalty method, observe that the…

Optimization and Control · Mathematics 2015-04-24 Aryan Mokhtari , Qing Ling , Alejandro Ribeiro

Distributed Quasi-Newton Method for Multi-Agent Optimization

We present a distributed quasi-Newton (DQN) method, which enables a group of agents to compute an optimal solution of a separable multi-agent optimization problem locally using an approximation of the curvature of the aggregate objective…

Optimization and Control · Mathematics 2024-09-30 Ola Shorinwa , Mac Schwager

A distributed semismooth Newton based augmented Lagrangian method for distributed optimization

This paper proposes a novel distributed semismooth Newton based augmented Lagrangian method for solving a class of optimization problems over networks, where the global objective is defined as the sum of locally held cost functions, and…

Optimization and Control · Mathematics 2026-03-02 Qihao Ma , Chengjing Wang , Peipei Tang , Dunbiao Niu , Aimin Xu

Quasi-Newton Updating for Large-Scale Distributed Learning

Distributed computing is critically important for modern statistical analysis. Herein, we develop a distributed quasi-Newton (DQN) framework with excellent statistical, computation, and communication efficiency. In the DQN method, no…

Machine Learning · Computer Science 2023-06-13 Shuyuan Wu , Danyang Huang , Hansheng Wang

A Distributed Newton Method for Large Scale Consensus Optimization

In this paper, we propose a distributed Newton method for consensus optimization. Our approach outperforms state-of-the-art methods, including ADMM. The key idea is to exploit the sparsity of the dual Hessian and recast the computation of…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-06-22 Rasul Tutunov , Haitham Bou Ammar , Ali Jadbabaie

Accelerated Dual Descent for Network Optimization

Dual descent methods are commonly used to solve network optimization problems because their implementation can be distributed through the network. However, their convergence rates are typically very slow. This paper introduces a family of…

Optimization and Control · Mathematics 2011-04-07 M. Zargham , A. Ribeiro , A. Jadbabaie , A. Ozdaglar

Distributed Cross-Layer Optimization in Wireless Networks: A Second-Order Approach

Due to the rapidly growing scale and heterogeneity of wireless networks, the design of distributed cross-layer optimization algorithms have received significant interest from the networking research community. So far, the standard…

Networking and Internet Architecture · Computer Science 2016-11-18 Jia Liu , Cathy H. Xia , Ness B. Shroff , Hanif D. Sherali

Fast, Accurate Second Order Methods for Network Optimization

Dual descent methods are commonly used to solve network flow optimization problems, since their implementation can be distributed over the network. These algorithms, however, often exhibit slow convergence rates. Approximate Newton methods…

Optimization and Control · Mathematics 2015-03-25 Rasul Tutunov , Haitham Bou Ammar , Ali Jadbabaie

Network Newton-Part I: Algorithm and Convergence

We study the problem of minimizing a sum of convex objective functions where the components of the objective are available at different nodes of a network and nodes are allowed to only communicate with their neighbors. The use of…

Optimization and Control · Mathematics 2015-04-24 Aryan Mokhtari , Qing Ling , Alejandro Ribeiro

Network Newton

We consider minimization of a sum of convex objective functions where the components of the objective are available at different nodes of a network and nodes are allowed to only communicate with their neighbors. The use of distributed…

Optimization and Control · Mathematics 2014-12-12 Aryan Mokhtari , Qing Ling , Alejandro Ribeiro

Newton Method over Networks is Fast up to the Statistical Precision

We propose a distributed cubic regularization of the Newton method for solving (constrained) empirical risk minimization problems over a network of agents, modeled as undirected graph. The algorithm employs an inexact, preconditioned Newton…

Optimization and Control · Mathematics 2021-06-21 Amir Daneshmand , Gesualdo Scutari , Pavel Dvurechensky , Alexander Gasnikov

Adaptive Regularized Newton Method with Inexact Hessian

Newton's method is the most widespread high-order method, demanding the gradient and the Hessian of the objective function. However, one of the main disadvantages of Newtons method is its lack of global convergence and high iteration cost.…

Optimization and Control · Mathematics 2025-12-10 Aleksandr Shestakov , Nail Bashirov , Andrei Semenov , Alexander Gasnikov , Martin Takáč , Aleksandr Beznosikov , Dmitry Kamzolov

Distributed Adaptive Greedy Quasi-Newton Methods with Explicit Non-asymptotic Convergence Bounds

Though quasi-Newton methods have been extensively studied in the literature, they either suffer from local convergence or use a series of line searches for global convergence which is not acceptable in the distributed setting. In this work,…

Optimization and Control · Mathematics 2023-12-01 Yubo Du , Keyou You

A Distributed Newton Approach for Joint Multi-Hop Routing and Flow Control: Theory and Algorithm

The fast growing scale and heterogeneity of current communication networks necessitate the design of distributed cross-layer optimization algorithms. So far, the standard approach of distributed cross-layer design is based on dual…

Networking and Internet Architecture · Computer Science 2011-08-11 Jia Liu , Hanif D. Sherali

Distributed Newton-like Algorithms and Learning for Optimized Power Dispatch

This thesis explores a particular class of distributed optimization methods for various separable resource allocation problems, which are of high interest in a wide array of multi-agent settings. A distinctly motivating application for this…

Systems and Control · Electrical Eng. & Systems 2021-03-26 Tor Anderson

Accelerating SGD for Distributed Deep-Learning Using Approximated Hessian Matrix

We introduce a novel method to compute a rank $m$ approximation of the inverse of the Hessian matrix in the distributed regime. By leveraging the differences in gradients and parameters of multiple Workers, we are able to efficiently…

Machine Learning · Computer Science 2017-09-18 Sébastien M. R. Arnold , Chunming Wang

Distributed second order methods with increasing number of working nodes

Recently, an idling mechanism has been introduced in the context of distributed \emph{first order} methods for minimization of a sum of nodes' local convex costs over a generic, connected network. With the idling mechanism, each node $i$,…

Information Theory · Computer Science 2018-09-21 Natasa Krklec Jerinkic , Dusan Jakovetic , Natasa Krejic , Dragana Bajovic

A Distributed Cubic-Regularized Newton Method for Smooth Convex Optimization over Networks

We propose a distributed, cubic-regularized Newton method for large-scale convex optimization over networks. The proposed method requires only local computations and communications and is suitable for federated learning applications over…

Optimization and Control · Mathematics 2020-07-08 César A. Uribe , Ali Jadbabaie

Practical Newton-Type Distributed Learning using Gradient Based Approximations

We study distributed algorithms for expected loss minimization where the datasets are large and have to be stored on different machines. Often we deal with minimizing the average of a set of convex functions where each function is the…

Machine Learning · Computer Science 2019-07-24 Samira Sheikhi