Related papers: Quantization Avoids Saddle Points in Distributed O…

Distributed Optimization via Gradient Descent with Event-Triggered Zooming over Quantized Communication

In this paper, we study unconstrained distributed optimization strongly convex problems, in which the exchange of information in the network is captured by a directed graph topology over digital channels that have limited capacity (and…

Systems and Control · Electrical Eng. & Systems 2023-09-12 Apostolos I. Rikos , Wei Jiang , Themistoklis Charalambous , Karl H. Johansson

Escaping Saddle Points in Distributed Newton's Method with Communication Efficiency and Byzantine Resilience

The problem of saddle-point avoidance for non-convex optimization is quite challenging in large scale distributed learning frameworks, such as Federated Learning, especially in the presence of Byzantine workers. The celebrated…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-12-30 Avishek Ghosh , Raj Kumar Maity , Arya Mazumdar , Kannan Ramchandran

Distributed Saddle-Point Problems: Lower Bounds, Near-Optimal and Robust Algorithms

This paper focuses on the distributed optimization of stochastic saddle point problems. The first part of the paper is devoted to lower bounds for the centralized and decentralized distributed methods for smooth (strongly) convex-(strongly)…

Machine Learning · Computer Science 2025-04-28 Aleksandr Beznosikov , Valentin Samokhin , Alexander Gasnikov

Decentralized Nonconvex Optimization with Guaranteed Privacy and Accuracy

Privacy protection and nonconvexity are two challenging problems in decentralized optimization and learning involving sensitive data. Despite some recent advances addressing each of the two problems separately, no results have been reported…

Optimization and Control · Mathematics 2022-12-16 Yongqiang Wang , Tamer Basar

Review Non-convex Optimization Method for Machine Learning

Non-convex optimization is a critical tool in advancing machine learning, especially for complex models like deep neural networks and support vector machines. Despite challenges such as multiple local minima and saddle points, non-convex…

Machine Learning · Computer Science 2024-10-04 Greg B Fotopoulos , Paul Popovich , Nicholas Hall Papadopoulos

A Regularized Saddle-Point Algorithm for Networked Optimization with Resource Allocation Constraints

We propose a regularized saddle-point algorithm for convex networked optimization problems with resource allocation constraints. Standard distributed gradient methods suffer from slow convergence and require excessive communication when…

Systems and Control · Computer Science 2012-08-16 Andrea Simonetto , Tamas Keviczky , Mikael Johansson

Distributed Stochastic Approximation for Solving Network Optimization Problems Under Random Quantization

We study distributed optimization problems over a network when the communication between the nodes is constrained, and so information that is exchanged between the nodes must be quantized. This imperfect communication poses a fundamental…

Optimization and Control · Mathematics 2018-10-30 Thinh T. Doan , Siva Theja Maguluri , Justin Romberg

Charged Point Normalization: An Efficient Solution to the Saddle Point Problem

Recently, the problem of local minima in very high dimensional non-convex optimization has been challenged and the problem of saddle points has been introduced. This paper introduces a dynamic type of normalization that forces the system to…

Machine Learning · Computer Science 2017-02-08 Armen Aghajanyan

Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks

We study non-smooth stochastic decentralized optimization problems over time-varying networks, where objective functions are distributed across nodes and network connections may intermittently appear or break. Specifically, we consider two…

Optimization and Control · Mathematics 2026-04-28 Maxim Divilkovskiy , Alexander Gasnikov

On the saddle point problem for non-convex optimization

A central challenge to many fields of science and engineering involves minimizing non-convex error functions over continuous, high dimensional spaces. Gradient descent or quasi-Newton methods are almost ubiquitously used to perform such…

Machine Learning · Computer Science 2014-05-29 Razvan Pascanu , Yann N. Dauphin , Surya Ganguli , Yoshua Bengio

Quantization enabled Privacy Protection in Decentralized Stochastic Optimization

By enabling multiple agents to cooperatively solve a global optimization problem in the absence of a central coordinator, decentralized stochastic optimization is gaining increasing attention in areas as diverse as machine learning,…

Optimization and Control · Mathematics 2022-08-10 Yongqiang Wang , Tamer Basar

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

A central challenge to many fields of science and engineering involves minimizing non-convex error functions over continuous, high dimensional spaces. Gradient descent or quasi-Newton methods are almost ubiquitously used to perform such…

Machine Learning · Computer Science 2014-06-11 Yann Dauphin , Razvan Pascanu , Caglar Gulcehre , Kyunghyun Cho , Surya Ganguli , Yoshua Bengio

Saddle-free Hessian-free Optimization

Nonconvex optimization problems such as the ones in training deep neural networks suffer from a phenomenon called saddle point proliferation. This means that there are a vast number of high error saddle points present in the loss function.…

Numerical Analysis · Computer Science 2016-11-08 Martin Arjovsky

Distributed Learning in Non-Convex Environments -- Part II: Polynomial Escape from Saddle-Points

The diffusion strategy for distributed learning from streaming data employs local stochastic gradient updates along with exchange of iterates over neighborhoods. In Part I [2] of this work we established that agents cluster around a network…

Multiagent Systems · Computer Science 2019-07-04 Stefan Vlaski , Ali H. Sayed

Distributed Stochastic Gradient Descent: Nonconvexity, Nonsmoothness, and Convergence to Local Minima

In centralized settings, it is well known that stochastic gradient descent (SGD) avoids saddle points and converges to local minima in nonconvex problems. However, similar guarantees are lacking for distributed first-order algorithms. The…

Optimization and Control · Mathematics 2022-03-07 Brian Swenson , Ryan Murray , H. Vincent Poor , Soummya Kar

Decentralized Distributed Optimization for Saddle Point Problems

We consider distributed convex-concave saddle point problems over arbitrary connected undirected networks and propose a decentralized distributed algorithm for their solution. The local functions distributed across the nodes are assumed to…

Optimization and Control · Mathematics 2024-04-10 Alexander Rogozin , Aleksandr Beznosikov , Darina Dvinskikh , Dmitry Kovalev , Pavel Dvurechensky , Alexander Gasnikov

Stochastic, Distributed and Federated Optimization for Machine Learning

We study optimization algorithms for the finite sum problems frequently arising in machine learning applications. First, we propose novel variants of stochastic gradient descent with a variance reduction property that enables linear…

Machine Learning · Computer Science 2017-07-06 Jakub Konečný

On Nonconvex Optimization for Machine Learning: Gradients, Stochasticity, and Saddle Points

Gradient descent (GD) and stochastic gradient descent (SGD) are the workhorses of large-scale machine learning. While classical theory focused on analyzing the performance of these methods in convex optimization problems, the most notable…

Machine Learning · Computer Science 2019-09-05 Chi Jin , Praneeth Netrapalli , Rong Ge , Sham M. Kakade , Michael I. Jordan

Escaping Saddle Points with the Successive Convex Approximation Algorithm

Optimizing non-convex functions is of primary importance in the vast majority of machine learning algorithms. Even though many gradient descent based algorithms have been studied, successive convex approximation based algorithms have been…

Optimization and Control · Mathematics 2019-03-06 Amrit Singh Bedi , Ketan Rajawat , Vaneet Aggarwal

Input-Output Performance of Linear-Quadratic Saddle-Point Algorithms with Application to Distributed Resource Allocation Problems

Saddle-point or primal-dual methods have recently attracted renewed interest as a systematic technique to design distributed algorithms which solve convex optimization problems. When implemented online for streaming data or as dynamic…

Optimization and Control · Mathematics 2021-04-22 John W. Simpson-Porco , Bala Kameshwar Poolla , Nima Monshizadeh , Florian Dorfler