Related papers: Distributed Block-diagonal Approximation Methods f…

Distributed Empirical Risk Minimization With Differential Privacy

This work studies the distributed empirical risk minimization (ERM) problem under differential privacy (DP) constraint. Standard distributed algorithms achieve DP typically by perturbing all local subgradients with noise, leading to…

Optimization and Control · Mathematics 2023-07-04 Changxin Liu , Karl H. Johansson , Yang Shi

A Distributed Quasi-Newton Algorithm for Primal and Dual Regularized Empirical Risk Minimization

We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving empirical risk minimization (ERM) problems with a nonsmooth regularization term. Our algorithm is applicable…

Machine Learning · Computer Science 2019-12-16 Ching-pei Lee , Cong Han Lim , Stephen J. Wright

A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization

In modern large-scale machine learning applications, the training data are often partitioned and stored on multiple machines. It is customary to employ the "data parallelism" approach, where the aggregated training loss is minimized without…

Machine Learning · Computer Science 2017-08-28 Shun Zheng , Jialei Wang , Fen Xia , Wei Xu , Tong Zhang

Computational Complexity of Sub-Linear Convergent Algorithms

Optimizing machine learning algorithms that are used to solve the objective function has been of great interest. Several approaches to optimize common algorithms, such as gradient descent and stochastic gradient descent, were explored. One…

Machine Learning · Computer Science 2022-10-06 Hilal AlQuabeh , Farha AlBreiki , Dilshod Azizov

Hedging Complexity in Generalization via a Parametric Distributionally Robust Optimization Framework

Empirical risk minimization (ERM) and distributionally robust optimization (DRO) are popular approaches for solving stochastic optimization problems that appear in operations management and machine learning. Existing generalization error…

Optimization and Control · Mathematics 2023-09-26 Garud Iyengar , Henry Lam , Tianyu Wang

A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization

We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving ERM problems with a nonsmooth regularization term. Current second-order and quasi-Newton methods for this…

Optimization and Control · Mathematics 2018-05-29 Ching-pei Lee , Cong Han Lim , Stephen J. Wright

Meta-Learned Invariant Risk Minimization

Empirical Risk Minimization (ERM) based machine learning algorithms have suffered from weak generalization performance on data obtained from out-of-distribution (OOD). To address this problem, Invariant Risk Minimization (IRM) objective was…

Machine Learning · Computer Science 2021-03-25 Jun-Hyun Bae , Inchul Choi , Minho Lee

Distributed Personalized Empirical Risk Minimization

This paper advocates a new paradigm Personalized Empirical Risk Minimization (PERM) to facilitate learning from heterogeneous data sources without imposing stringent constraints on computational resources shared by participating devices. In…

Machine Learning · Computer Science 2023-10-30 Yuyang Deng , Mohammad Mahdi Kamani , Pouria Mahdavinia , Mehrdad Mahdavi

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a…

Optimization and Control · Mathematics 2024-01-04 Daokuan Zhu , Tianqi Xu , Jie Lu

Frustratingly Easy Model Generalization by Dummy Risk Minimization

Empirical risk minimization (ERM) is a fundamental machine learning paradigm. However, its generalization ability is limited in various tasks. In this paper, we devise Dummy Risk Minimization (DuRM), a frustratingly easy and general…

Machine Learning · Computer Science 2023-10-10 Juncheng Wang , Jindong Wang , Xixu Hu , Shujun Wang , Xing Xie

Randomized Block-Diagonal Preconditioning for Parallel Learning

We study preconditioned gradient-based optimization methods where the preconditioning matrix has block-diagonal form. Such a structural constraint comes with the advantage that the update computation is block-separable and can be…

Machine Learning · Computer Science 2020-12-08 Celestine Mendler-Dünner , Aurelien Lucchi

Accelerated Doubly Stochastic Gradient Algorithm for Large-scale Empirical Risk Minimization

Nowadays, algorithms with fast convergence, small memory footprints, and low per-iteration complexity are particularly favorable for artificial intelligence applications. In this paper, we propose a doubly stochastic algorithm with a novel…

Machine Learning · Computer Science 2023-04-25 Zebang Shen , Hui Qian , Tongzhou Mu , Chao Zhang

Distributed Attack-Robust Submodular Maximization for Multi-Robot Planning

In this paper, we design algorithms to protect swarm-robotics applications against sensor denial-of-service (DoS) attacks on robots. We focus on applications requiring the robots to jointly select actions, e.g., which trajectory to follow,…

Robotics · Computer Science 2022-03-21 Lifeng Zhou , Vasileios Tzoumas , George J. Pappas , Pratap Tokekar

Max-Diversity Distributed Learning: Theory and Algorithms

We study the risk performance of distributed learning for the regularization empirical risk minimization with fast convergence rate, substantially improving the error analysis of the existing divide-and-conquer based distributed learning.…

Machine Learning · Computer Science 2019-01-21 Yong Liu , Jian Li , Weiping Wang

Federated Empirical Risk Minimization via Second-Order Method

Many convex optimization problems with important applications in machine learning are formulated as empirical risk minimization (ERM). There are several examples: linear and logistic regression, LASSO, kernel regression, quantile…

Machine Learning · Computer Science 2023-05-30 Song Bian , Zhao Song , Junze Yin

Data optimization for large batch distributed training of deep neural networks

Distributed training in deep learning (DL) is common practice as data and models grow. The current practice for distributed training of deep neural networks faces the challenges of communication bottlenecks when operating at scale, and…

Machine Learning · Computer Science 2020-12-21 Shubhankar Gahlot , Junqi Yin , Mallikarjun Shankar

Diametrical Risk Minimization: Theory and Computations

The theoretical and empirical performance of Empirical Risk Minimization (ERM) often suffers when loss functions are poorly behaved with large Lipschitz moduli and spurious sharp minimizers. We propose and analyze a counterpart to ERM…

Optimization and Control · Mathematics 2021-07-08 Matthew Norton , Johannes O. Royset

Doubly Accelerated Stochastic Variance Reduced Dual Averaging Method for Regularized Empirical Risk Minimization

In this paper, we develop a new accelerated stochastic gradient method for efficiently solving the convex regularized empirical risk minimization problem in mini-batch settings. The use of mini-batches is becoming a golden standard in the…

Optimization and Control · Mathematics 2017-09-20 Tomoya Murata , Taiji Suzuki

Distributed Stochastic Optimization of the Regularized Risk

Many machine learning algorithms minimize a regularized risk, and stochastic optimization is widely used for this task. When working with massive data, it is desirable to perform stochastic optimization in parallel. Unfortunately, many…

Machine Learning · Statistics 2023-11-27 Shin Matsushima , Hyokun Yun , Xinhua Zhang , S. V. N. Vishwanathan

A Constrained Optimization Perspective of Unrolled Transformers

We introduce a constrained optimization framework for training transformers that behave like optimization descent algorithms. Specifically, we enforce layerwise descent constraints on the objective function and replace standard empirical…

Machine Learning · Computer Science 2026-01-27 Javier Porras-Valenzuela , Samar Hadou , Alejandro Ribeiro