Related papers: Accelerated Forward-Backward Optimization using De…

Accelerated Backward Forward Method for Convex Optimization

We analyze the convergence rate of an accelerated backward forward method for solving convex composite optimization problems. The method was developed by Taylor, Hendrickx and Glineur, and is different from the FISTA algorithm in its…

Optimization and Control · Mathematics 2026-04-30 Zepeng Wang , Juan Peypouquet

A Field Guide to Forward-Backward Splitting with a FASTA Implementation

Non-differentiable and constrained optimization play a key role in machine learning, signal and image processing, communications, and beyond. For high-dimensional minimization problems involving large datasets or many unknowns, the…

Numerical Analysis · Computer Science 2016-12-30 Tom Goldstein , Christoph Studer , Richard Baraniuk

Accelerating Deep Learning by Focusing on the Biggest Losers

This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward…

Machine Learning · Computer Science 2019-10-03 Angela H. Jiang , Daniel L. -K. Wong , Giulio Zhou , David G. Andersen , Jeffrey Dean , Gregory R. Ganger , Gauri Joshi , Michael Kaminksy , Michael Kozuch , Zachary C. Lipton , Padmanabhan Pillai

Reinforced Continual Learning

Most artificial intelligence models have limiting ability to solve new tasks faster, without forgetting previously acquired knowledge. The recently emerging paradigm of continual learning aims to solve this issue, in which the model learns…

Machine Learning · Computer Science 2018-06-01 Ju Xu , Zhanxing Zhu

Descent-Net: Learning Descent Directions for Constrained Optimization

Deep learning approaches, known for their ability to model complex relationships and fast execution, are increasingly being applied to solve large optimization problems. However, existing methods often face challenges in simultaneously…

Optimization and Control · Mathematics 2025-12-16 Zisheng Zhou , Dengyu Zheng , Zirui Chen , Shixiang Chen

Narrowing the Focus: Learned Optimizers for Pretrained Models

In modern deep learning, the models are learned by applying gradient updates using an optimizer, which transforms the updates based on various statistics. Optimizers are often hand-designed and tuning their hyperparameters is a big part of…

Machine Learning · Computer Science 2024-10-08 Gus Kristiansen , Mark Sandler , Andrey Zhmoginov , Nolan Miller , Anirudh Goyal , Jihwan Lee , Max Vladymyrov

Deep Learning Approximation: Zero-Shot Neural Network Speedup

Neural networks offer high-accuracy solutions to a range of problems, but are costly to run in production systems because of computational and memory requirements during a forward pass. Given a trained network, we propose a techique called…

Computer Vision and Pattern Recognition · Computer Science 2018-06-18 Michele Pratusevich

Large-Scale Deep Learning Optimizations: A Comprehensive Survey

Deep learning have achieved promising results on a wide spectrum of AI applications. Larger datasets and models consistently yield better performance. However, we generally spend longer training time on more computation and communication.…

Machine Learning · Computer Science 2021-11-03 Xiaoxin He , Fuzhao Xue , Xiaozhe Ren , Yang You

Deep Feedback Inverse Problem Solver

We present an efficient, effective, and generic approach towards solving inverse problems. The key idea is to leverage the feedback signal provided by the forward process and learn an iterative update model. Specifically, at each iteration,…

Computer Vision and Pattern Recognition · Computer Science 2021-01-20 Wei-Chiu Ma , Shenlong Wang , Jiayuan Gu , Sivabalan Manivasagam , Antonio Torralba , Raquel Urtasun

Accelerated Almost-Sure Convergence Rates for Nonconvex Stochastic Gradient Descent using Stochastic Learning Rates

Large-scale optimization problems require algorithms both effective and efficient. One such popular and proven algorithm is Stochastic Gradient Descent which uses first-order gradient information to solve these problems. This paper studies…

Optimization and Control · Mathematics 2021-11-11 Theodoros Mamalis , Dusan Stipanovic , Petros Voulgaris

Fixed-point optimization of deep neural networks with adaptive step size retraining

Fixed-point optimization of deep neural networks plays an important role in hardware based design and low-power implementations. Many deep neural networks show fairly good performance even with 2- or 3-bit precision when quantized weights…

Machine Learning · Computer Science 2017-02-28 Sungho Shin , Yoonho Boo , Wonyong Sung

Accelerated Proximal Gradient Method with Backtracking for Multiobjective Optimization

This paper proposes a new backtracking strategy based on the FISTA accelerated algorithm for multiobjective optimization problems. The strategy focuses on solving the problem of Lipschitz constant being unknown. It allows estimate parameter…

Optimization and Control · Mathematics 2024-12-31 Chengzhi Huang , Jian Chen , Liping Tang

Forward Thinking: Building and Training Neural Networks One Layer at a Time

We present a general framework for training deep neural networks without backpropagation. This substantially decreases training time and also allows for construction of deep networks with many sorts of learners, including networks whose…

Machine Learning · Statistics 2017-06-09 Chris Hettinger , Tanner Christensen , Ben Ehlert , Jeffrey Humpherys , Tyler Jarvis , Sean Wade

Decoupled Parallel Backpropagation with Convergence Guarantee

Backpropagation algorithm is indispensable for the training of feedforward neural networks. It requires propagating error gradients sequentially from the output layer all the way back to the input layer. The backward locking in…

Machine Learning · Computer Science 2018-07-24 Zhouyuan Huo , Bin Gu , Qian Yang , Heng Huang

Asynchronous Federated Optimization

Federated learning enables training on a massive number of edge devices. To improve flexibility and scalability, we propose a new asynchronous federated optimization algorithm. We prove that the proposed approach has near-linear convergence…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-12-08 Cong Xie , Sanmi Koyejo , Indranil Gupta

Accelerated Target Updates for Q-learning

This paper studies accelerations in Q-learning algorithms. We propose an accelerated target update scheme by incorporating the historical iterates of Q functions. The idea is conceptually inspired by the momentum-based accelerated methods…

Machine Learning · Computer Science 2019-05-14 Bowen Weng , Huaqing Xiong , Wei Zhang

The Concept of Forward-Forward Learning Applied to a Multi Output Perceptron

The concept of a recently proposed Forward-Forward learning algorithm for fully connected artificial neural networks is applied to a single multi output perceptron for classification. The parameters of the system are trained with respect to…

Machine Learning · Computer Science 2023-04-07 K. Fredrik Karlsson

Automatic Differentiation of Optimization Algorithms with Time-Varying Updates

Numerous Optimization Algorithms have a time-varying update rule thanks to, for instance, a changing step size, momentum parameter or, Hessian approximation. In this paper, we apply unrolled or automatic differentiation to a time-varying…

Optimization and Control · Mathematics 2024-10-28 Sheheryar Mehmood , Peter Ochs

Faster Convergence & Generalization in DNNs

Deep neural networks have gained tremendous popularity in last few years. They have been applied for the task of classification in almost every domain. Despite the success, deep networks can be incredibly slow to train for even moderate…

Machine Learning · Computer Science 2018-10-11 Gaurav Singh , John Shawe-Taylor

Proximal Quasi-Newton Methods for Regularized Convex Optimization with Linear and Accelerated Sublinear Convergence Rates

In [19], a general, inexact, efficient proximal quasi-Newton algorithm for composite optimization problems has been proposed and a sublinear global convergence rate has been established. In this paper, we analyze the convergence properties…

Numerical Analysis · Computer Science 2017-10-18 Hiva Ghanbari , Katya Scheinberg