Related papers: Algorithms for solving optimization problems arisi…

Algorithms for solving optimization problems arising from deep neural net models: smooth problems

Machine Learning models incorporating multiple layered learning networks have been seen to provide effective models for various classification problems. The resulting optimization problem to solve for the optimal vector minimizing the…

Optimization and Control · Mathematics 2018-07-03 Vyacheslav Kungurtsev , Tomas Pevny

Non-Convex Optimizations for Machine Learning with Theoretical Guarantee: Robust Matrix Completion and Neural Network Learning

Despite the recent development in machine learning, most learning systems are still under the concept of "black box", where the performance cannot be understood and derived. With the rise of safety and privacy concerns in public, designing…

Machine Learning · Computer Science 2023-06-30 Shuai Zhang

Non-convex Optimization for Machine Learning

A vast majority of machine learning algorithms train their models and perform inference by solving optimization problems. In order to capture the learning and prediction problems accurately, structural constraints such as sparsity or low…

Machine Learning · Statistics 2017-12-22 Prateek Jain , Purushottam Kar

Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks

The presence of non-convexity in smooth optimization problems arising from deep learning have sparked new smoothness conditions in the literature and corresponding convergence analyses. We discuss these smoothness conditions, order them,…

Machine Learning · Computer Science 2024-09-23 Vivak Patel , Christian Varner

Data-driven nonsmooth optimization

In this work, we consider methods for solving large-scale optimization problems with a possibly nonsmooth objective function. The key idea is to first specify a class of optimization algorithms using a generic iterative scheme involving…

Optimization and Control · Mathematics 2020-02-19 Sebastian Banert , Axel Ringh , Jonas Adler , Johan Karlsson , Ozan Öktem

Optimization Methods for Supervised Machine Learning: From Linear Models to Deep Learning

The goal of this tutorial is to introduce key models, algorithms, and open questions related to the use of optimization methods for solving problems arising in machine learning. It is written with an INFORMS audience in mind, specifically…

Machine Learning · Statistics 2017-07-03 Frank E. Curtis , Katya Scheinberg

Randomized Smoothing SVRG for Large-scale Nonsmooth Convex Optimization

In this paper, we consider the problem of minimizing the average of a large number of nonsmooth and convex functions. Such problems often arise in typical machine learning problems as empirical risk minimization, but are computationally…

Machine Learning · Statistics 2018-05-21 Wenjie Huang

Differentiable Convex Optimization Layers in Neural Architectures: Foundations and Perspectives

The integration of optimization problems within neural network architectures represents a fundamental shift from traditional approaches to handling constraints in deep learning. While it is long known that neural networks can incorporate…

Machine Learning · Computer Science 2024-12-31 Calder Katyal

Learning Hard Optimization Problems: A Data Generation Perspective

Optimization problems are ubiquitous in our societies and are present in almost every segment of the economy. Most of these optimization problems are NP-hard and computationally demanding, often requiring approximate solutions for…

Optimization and Control · Mathematics 2021-06-23 James Kotary , Ferdinando Fioretto , Pascal Van Hentenryck

Qualitatively characterizing neural network optimization problems

Training neural networks involves solving large-scale non-convex optimization problems. This task has long been believed to be extremely difficult, with fear of local minima and other obstacles motivating a variety of schemes to improve…

Neural and Evolutionary Computing · Computer Science 2015-05-25 Ian J. Goodfellow , Oriol Vinyals , Andrew M. Saxe

Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization

Various optimal gradient-based algorithms have been developed for smooth nonconvex optimization. However, many nonconvex machine learning problems do not belong to the class of smooth functions and therefore the existing algorithms are…

Optimization and Control · Mathematics 2023-06-27 Ziyi Chen , Yi Zhou , Yingbin Liang , Zhaosong Lu

Mollifying Networks

The optimization of deep neural networks can be more challenging than traditional convex optimization problems due to the highly non-convex nature of the loss function, e.g. it can involve pathological landscapes such as saddle-surfaces…

Machine Learning · Computer Science 2016-08-18 Caglar Gulcehre , Marcin Moczulski , Francesco Visin , Yoshua Bengio

Efficient random coordinate descent algorithms for large-scale structured nonconvex optimization

In this paper we analyze several new methods for solving nonconvex optimization problems with the objective function formed as a sum of two terms: one is nonconvex and smooth, and another is convex but simple and its structure is known.…

Optimization and Control · Mathematics 2014-06-25 A. Patrascu , I. Necoara

Gradient Sampling Methods for Nonsmooth Optimization

This paper reviews the gradient sampling methodology for solving nonsmooth, nonconvex optimization problems. An intuitively straightforward gradient sampling algorithm is stated and its convergence properties are summarized. Throughout this…

Optimization and Control · Mathematics 2018-05-01 James V. Burke , Frank E. Curtis , Adrian S. Lewis , Michael L. Overton , Lucas E. A. Simões

Why Do Local Methods Solve Nonconvex Problems?

Non-convex optimization is ubiquitous in modern machine learning. Researchers devise non-convex objective functions and optimize them using off-the-shelf optimizers such as stochastic gradient descent and its variants, which leverage the…

Machine Learning · Computer Science 2021-03-26 Tengyu Ma

A Robust Algorithm for Non-IID Machine Learning Problems with Convergence Analysis

In this paper, we propose an improved numerical algorithm for solving minimax problems based on nonsmooth optimization, quadratic programming and iterative process. We also provide a rigorous proof of convergence for our algorithm under…

Artificial Intelligence · Computer Science 2025-07-02 Qing Xu , Xiaohua Xuan

Optimization in Machine Learning: A Distribution Space Approach

We present the viewpoint that optimization problems encountered in machine learning can often be interpreted as minimizing a convex functional over a function space, but with a non-convex constraint set introduced by model parameterization.…

Machine Learning · Computer Science 2020-04-21 Yongqiang Cai , Qianxiao Li , Zuowei Shen

Dual Smoothing for Decentralized Optimization

Decentralized optimization is widely used in different fields of study such as distributed learning, signal processing, and various distributed control problems. In these types of problems, nodes of the network are connected to each other…

Optimization and Control · Mathematics 2025-12-10 Alexander Rogozin , Nhat Trung Nguyen , Hamed Azami Zenuzagh , Alexander Gasnikov

A SMART Stochastic Algorithm for Nonconvex Optimization with Applications to Robust Machine Learning

In this paper, we show how to transform any optimization problem that arises from fitting a machine learning model into one that (1) detects and removes contaminated data from the training set while (2) simultaneously fitting the trimmed…

Machine Learning · Statistics 2017-02-07 Aleksandr Aravkin , Damek Davis

Extracting Optimal Solution Manifolds using Constrained Neural Optimization

Constrained Optimization solution algorithms are restricted to point based solutions. In practice, single or multiple objectives must be satisfied, wherein both the objective function and constraints can be non-convex resulting in multiple…

Neural and Evolutionary Computing · Computer Science 2021-01-05 Gurpreet Singh , Soumyajit Gupta , Matthew Lease