Related papers: Identifying global optimality for dictionary learn…

Global Optimality in Tensor Factorization, Deep Learning, and Beyond

Techniques involving factorization are found in a wide range of applications and have enjoyed significant empirical success in many fields. However, common to a vast majority of these problems is the significant disadvantage that the…

Numerical Analysis · Computer Science 2015-06-26 Benjamin D. Haeffele , Rene Vidal

Alternating minimization for dictionary learning: Local Convergence Guarantees

We present theoretical guarantees for an alternating minimization algorithm for the dictionary learning/sparse coding problem. The dictionary learning problem is to factorize vector samples $y^{1},y^{2},\ldots, y^{n}$ into an appropriate…

Machine Learning · Statistics 2019-08-01 Niladri S. Chatterji , Peter L. Bartlett

Meta-learning based Alternating Minimization Algorithm for Non-convex Optimization

In this paper, we propose a novel solution for non-convex problems of multiple variables, especially for those typically solved by an alternating minimization (AM) strategy that splits the original optimization problem into a set of…

Machine Learning · Computer Science 2022-06-28 Jingyuan Xia , Shengxi Li , Jun-Jie Huang , Imad Jaimoukha , Deniz Gunduz

Analysis of Fast Alternating Minimization for Structured Dictionary Learning

Methods exploiting sparsity have been popular in imaging and signal processing applications including compression, denoising, and imaging inverse problems. Data-driven approaches such as dictionary learning and transform learning enable one…

Machine Learning · Computer Science 2018-02-05 Saiprasad Ravishankar , Anna Ma , Deanna Needell

Global Optimality in Separable Dictionary Learning with Applications to the Analysis of Diffusion MRI

Sparse dictionary learning is a popular method for representing signals as linear combinations of a few elements from a dictionary that is learned from the data. In the classical setting, signals are represented as vectors and the…

Optimization and Control · Mathematics 2019-09-20 Evan Schwab , Benjamin D. Haeffele , René Vidal , Nicolas Charon

Inertial Proximal Deep Learning Alternating Minimization for Efficient Neutral Network Training

In recent years, the Deep Learning Alternating Minimization (DLAM), which is actually the alternating minimization applied to the penalty form of the deep neutral networks training, has been developed as an alternative algorithm to overcome…

Machine Learning · Computer Science 2021-02-02 Linbo Qiao , Tao Sun , Hengyue Pan , Dongsheng Li

Learning Sparsely Used Overcomplete Dictionaries via Alternating Minimization

We consider the problem of sparse coding, where each sample consists of a sparse linear combination of a set of dictionary atoms, and the task is to learn both the dictionary elements and the mixing coefficients. Alternating minimization is…

Machine Learning · Computer Science 2014-07-30 Alekh Agarwal , Animashree Anandkumar , Prateek Jain , Praneeth Netrapalli

ADMM for Efficient Deep Learning with Global Convergence

Alternating Direction Method of Multipliers (ADMM) has been used successfully in many conventional machine learning applications and is considered to be a useful alternative to Stochastic Gradient Descent (SGD) as a deep learning optimizer.…

Optimization and Control · Mathematics 2021-07-07 Junxiang Wang , Fuxun Yu , Xiang Chen , Liang Zhao

Simple Alternating Minimization Provably Solves Complete Dictionary Learning

This paper focuses on the noiseless complete dictionary learning problem, where the goal is to represent a set of given signals as linear combinations of a small number of atoms from a learned dictionary. There are two main challenges faced…

Machine Learning · Computer Science 2025-03-06 Geyu Liang , Gavin Zhang , Salar Fattahi , Richard Y. Zhang

DQM: Decentralized Quadratically Approximated Alternating Direction Method of Multipliers

This paper considers decentralized consensus optimization problems where nodes of a network have access to different summands of a global objective function. Nodes cooperate to minimize the global objective by exchanging information with…

Optimization and Control · Mathematics 2016-09-21 Aryan Mokhtari , Wei Shi , Qing Ling , Alejandro Ribeiro

Distributed Global Optimization by Annealing

The paper considers a distributed algorithm for global minimization of a nonconvex function. The algorithm is a first-order consensus + innovations type algorithm that incorporates decaying additive Gaussian noise for annealing, converging…

Optimization and Control · Mathematics 2019-07-23 Brian Swenson , Soummya Kar , H. Vincent Poor , José M. F. Moura

A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization

In modern large-scale machine learning applications, the training data are often partitioned and stored on multiple machines. It is customary to employ the "data parallelism" approach, where the aggregated training loss is minimized without…

Machine Learning · Computer Science 2017-08-28 Shun Zheng , Jialei Wang , Fen Xia , Wei Xu , Tong Zhang

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

Numerous empirical evidences have corroborated the importance of noise in nonconvex optimization problems. The theory behind such empirical observations, however, is still largely unknown. This paper studies this fundamental problem through…

Machine Learning · Computer Science 2021-02-25 Tianyi Liu , Yan Li , Song Wei , Enlu Zhou , Tuo Zhao

Optimization with delay-induced bifurcations

Optimization is finding the best solution, which mathematically amounts to locating the global minimum of some cost function. Optimization is traditionally automated with digital or quantum computers, each having their limitations and none…

Statistical Mechanics · Physics 2021-11-16 Natalia B. Janson , Christopher J. Marsden

Finding Global Optima in Nonconvex Stochastic Semidefinite Optimization with Variance Reduction

There is a recent surge of interest in nonconvex reformulations via low-rank factorization for stochastic convex semidefinite optimization problem in the purpose of efficiency and scalability. Compared with the original convex formulations,…

Optimization and Control · Mathematics 2018-02-27 Jinshan Zeng , Ke Ma , Yuan Yao

Evolution of Optimization Algorithms for Global Placement via Large Language Models

Optimization algorithms are widely employed to tackle complex problems, but designing them manually is often labor-intensive and requires significant expertise. Global placement is a fundamental step in electronic design automation (EDA).…

Neural and Evolutionary Computing · Computer Science 2025-04-28 Xufeng Yao , Jiaxi Jiang , Yuxuan Zhao , Peiyu Liao , Yibo Lin , Bei Yu

An Efficient Approximate Method for Online Convolutional Dictionary Learning

Most existing convolutional dictionary learning (CDL) algorithms are based on batch learning, where the dictionary filters and the convolutional sparse representations are optimized in an alternating manner using a training dataset. When…

Computer Vision and Pattern Recognition · Computer Science 2023-01-26 Farshad G. Veshki , Sergiy A. Vorobyov

LLM as a Complementary Optimizer to Gradient Descent: A Case Study in Prompt Tuning

Mastering a skill generally relies on both hands-on experience from doers and insightful, high-level guidance by mentors. Will this strategy also work well for solving complex non-convex optimization problems? Here, a common gradient-based…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Zixian Guo , Ming Liu , Zhilong Ji , Jinfeng Bai , Yiwen Guo , Wangmeng Zuo

Fast and Provable ADMM for Learning with Generative Priors

In this work, we propose a (linearized) Alternating Direction Method-of-Multipliers (ADMM) algorithm for minimizing a convex function subject to a nonconvex constraint. We focus on the special case where such constraint arises from the…

Machine Learning · Computer Science 2019-07-09 Fabian Latorre Gómez , Armin Eftekhari , Volkan Cevher

Global Optimality in Distributed Low-rank Matrix Factorization

We study the convergence of a variant of distributed gradient descent (DGD) on a distributed low-rank matrix approximation problem wherein some optimization variables are used for consensus (as in classical DGD) and some optimization…

Optimization and Control · Mathematics 2018-12-27 Zhihui Zhu , Qiuwei Li , Xinshuo Yang , Gongguo Tang , Michael B. Wakin