Related papers: An adaptive gradient method for computing generali…

An Adaptive Shifted Power Method for Computing Generalized Tensor Eigenpairs

Several tensor eigenpair definitions have been put forth in the past decade, but these can all be unified under generalized tensor eigenpair framework, introduced by Chang, Pearson, and Zhang (2009). Given mth-order, n-dimensional…

Numerical Analysis · Mathematics 2014-12-22 Tamara G. Kolda , Jackson R. Mayo

Training Generative Adversarial Networks with Adaptive Composite Gradient

The wide applications of Generative adversarial networks benefit from the successful training methods, guaranteeing that an object function converges to the local minima. Nevertheless, designing an efficient and competitive training method…

Machine Learning · Computer Science 2021-11-11 Huiqing Qi , Fang Li , Shengli Tan , Xiangyun Zhang

Accelerated Gradient Methods for Sparse Statistical Learning with Nonconvex Penalties

Nesterov's accelerated gradient (AG) is a popular technique to optimize objective functions comprising two components: a convex loss and a penalty function. While AG methods perform well for convex penalties, such as the LASSO, convergence…

Optimization and Control · Mathematics 2024-01-04 Kai Yang , Masoud Asgharian , Sahir Bhatnagar

Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks

Adaptive gradient methods, which adopt historical gradient information to automatically adjust the learning rate, despite the nice property of fast convergence, have been observed to generalize worse than stochastic gradient descent (SGD)…

Machine Learning · Computer Science 2020-06-24 Jinghui Chen , Dongruo Zhou , Yiqi Tang , Ziyan Yang , Yuan Cao , Quanquan Gu

Accelerated Gradient Methods for Nonconvex Nonlinear and Stochastic Programming

In this paper, we generalize the well-known Nesterov's accelerated gradient (AG) method, originally designed for convex smooth optimization, to solve nonconvex and possibly stochastic optimization problems. We demonstrate that by properly…

Optimization and Control · Mathematics 2013-10-15 Saeed Ghadimi , Guanghui Lan

Shifted and extrapolated power methods for tensor $\ell^p$-eigenpairs

This work is concerned with the computation of $\ell^p$-eigenvalues and eigenvectors of square tensors with $d$ modes. In the first part we propose two possible shifted variants of the popular (higher-order) power method %for the…

Numerical Analysis · Mathematics 2019-12-05 Stefano Cipolla , Michela Redivo-Zaglia , Francesco Tudisco

Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent

The problem of low-tubal-rank tensor estimation is a fundamental task with wide applications across high-dimensional signal processing, machine learning, and image science. Traditional approaches tackle such a problem by performing tensor…

Machine Learning · Computer Science 2025-12-24 Zhiyu Liu , Zhi Han , Yandong Tang , Jun Fan , Yao Wang

Adjusting the Output of Decision Transformer with Action Gradient

Decision Transformer (DT), which integrates reinforcement learning (RL) with the transformer model, introduces a novel approach to offline RL. Unlike classical algorithms that take maximizing cumulative discounted rewards as objective, DT…

Machine Learning · Computer Science 2025-10-08 Rui Lin , Yiwen Zhang , Zhicheng Peng , Minghao Lyu

Computing tensor eigenvalues via homotopy methods

We introduce the concept of mode-k generalized eigenvalues and eigenvectors of a tensor and prove some properties of such eigenpairs. In particular, we derive an upper bound for the number of equivalence classes of generalized tensor…

Numerical Analysis · Mathematics 2016-01-15 Liping Chen , Lixing Han , Liangmin Zhou

Adaptive Consensus Optimization Method for GANs

We propose a second order gradient based method with ADAM and RMSprop for the training of generative adversarial networks. The proposed method is fastest to obtain similar accuracy when compared to prominent second order methods. Unlike…

Machine Learning · Computer Science 2023-04-21 Sachin Kumar Danisetty , Santhosh Reddy Mylaram , Pawan Kumar

Global Adaptive Generative Adjustment

Many traditional signal recovery approaches can behave well basing on the penalized likelihood. However, they have to meet with the difficulty in the selection of hyperparameters or tuning parameters in the penalties. In this article, we…

Machine Learning · Statistics 2022-11-17 Bin Wang , Xiaofei Wang , Jianhua Guo

A Feasible Conjugate Gradient Method for Calculating $\mathcal B$-Eigenpairs of Symmetric Tensors

In this paper, we propose a feasible conjugate gradient (FCG) method for calculating ${\mathcal B}$-eigenpairs of a symmetric tensor ${\mathcal A}$. The method is an extension of the well-known conjugate gradient method for unconstrained…

Optimization and Control · Mathematics 2023-11-15 Jiefeng Xu , Can Li , Dong-Hui Li

Tensor Generalized Approximate Message Passing

We propose a tensor generalized approximate message passing (TeG-AMP) algorithm for low-rank tensor inference, which can be used to solve tensor completion and decomposition problems. We derive TeG-AMP algorithm as an approximation of the…

Machine Learning · Computer Science 2025-04-02 Yinchuan Li , Guangchen Lan , Xiaodong Wang

Convergence of the Generalized Alternating Projection Algorithm for Compressive Sensing

The convergence of the generalized alternating projection (GAP) algorithm is studied in this paper to solve the compressive sensing problem $\yv = \Amat \xv + \epsilonv$. By assuming that $\Amat\Amat\ts$ is invertible, we prove that GAP…

Information Theory · Computer Science 2015-09-22 Xin Yuan , Hong Jiang , Paul Wilford

Anderson acceleration of gradient methods with energy for optimization problems

Anderson acceleration (AA) as an efficient technique for speeding up the convergence of fixed-point iterations may be designed for accelerating an optimization method. We propose a novel optimization algorithm by adapting Anderson…

Optimization and Control · Mathematics 2022-11-17 Hailiang Liu , Jia-Hao He , Xuping Tian

On Distributed Adaptive Optimization with Gradient Compression

We study COMP-AMS, a distributed optimization framework based on gradient averaging and adaptive AMSGrad algorithm. Gradient compression with error feedback is applied to reduce the communication cost in the gradient transmission process.…

Machine Learning · Statistics 2022-05-12 Xiaoyun Li , Belhal Karimi , Ping Li

On the Convergence of Decentralized Adaptive Gradient Methods

Adaptive gradient methods including Adam, AdaGrad, and their variants have been very successful for training deep learning models, such as neural networks. Meanwhile, given the need for distributed computing, distributed optimization…

Machine Learning · Computer Science 2021-09-08 Xiangyi Chen , Belhal Karimi , Weijie Zhao , Ping Li

Adaptive Accelerated Gradient Converging Methods under Holderian Error Bound Condition

Recent studies have shown that proximal gradient (PG) method and accelerated gradient method (APG) with restarting can enjoy a linear convergence under a weaker condition than strong convexity, namely a quadratic growth condition (QGC).…

Optimization and Control · Mathematics 2017-05-16 Mingrui Liu , Tianbao Yang

Accelerating the Computation of Tensor $Z$-eigenvalues

Efficient solvers for tensor eigenvalue problems are important tools for the analysis of higher-order data sets. Here we introduce, analyze and demonstrate an extrapolation method to accelerate the widely used shifted symmetric higher order…

Numerical Analysis · Mathematics 2023-07-25 Sara Pollock , Rhea Shroff

HT-AWGM: A Hierarchical Tucker-Adaptive Wavelet Galerkin Method for High Dimensional Elliptic Problems

This paper is concerned with the construction, analysis and realization of a numerical method to approximate the solution of high dimensional elliptic partial differential equations. We propose a new combination of an Adaptive Wavelet…

Numerical Analysis · Mathematics 2018-05-31 Mazen Ali , Karsten Urban