Related papers: Direct Nonlinear Acceleration

Nonlinear Acceleration of CNNs

The Regularized Nonlinear Acceleration (RNA) algorithm is an acceleration method capable of improving the rate of convergence of many optimization schemes such as gradient descend, SAGA or SVRG. Until now, its analysis is limited to convex…

Optimization and Control · Mathematics 2018-06-04 Damien Scieur , Edouard Oyallon , Alexandre d'Aspremont , Francis Bach

Online Regularized Nonlinear Acceleration

Regularized nonlinear acceleration (RNA) estimates the minimum of a function by post-processing iterates from an algorithm such as the gradient method. It can be seen as a regularized version of Anderson acceleration, a classical…

Optimization and Control · Mathematics 2019-06-25 Damien Scieur , Edouard Oyallon , Alexandre d'Aspremont , Francis Bach

Interpolatron: Interpolation or Extrapolation Schemes to Accelerate Optimization for Deep Neural Networks

In this paper we explore acceleration techniques for large scale nonconvex optimization problems with special focuses on deep neural networks. The extrapolation scheme is a classical approach for accelerating stochastic gradient descent for…

Machine Learning · Statistics 2018-05-18 Guangzeng Xie , Yitan Wang , Shuchang Zhou , Zhihua Zhang

Efficient and Accurate Approximations of Nonlinear Convolutional Networks

This paper aims to accelerate the test-time computation of deep convolutional neural networks (CNNs). Unlike existing methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into…

Computer Vision and Pattern Recognition · Computer Science 2014-11-18 Xiangyu Zhang , Jianhua Zou , Xiang Ming , Kaiming He , Jian Sun

DNA: Deeply-supervised Nonlinear Aggregation for Salient Object Detection

Recent progress on salient object detection mainly aims at exploiting how to effectively integrate multi-scale convolutional features in convolutional neural networks (CNNs). Many popular methods impose deep supervision to perform…

Computer Vision and Pattern Recognition · Computer Science 2021-01-21 Yun Liu , Ming-Ming Cheng , Xinyu Zhang , Guang-Yu Nie , Meng Wang

Neural Network Training Techniques Regularize Optimization Trajectory: An Empirical Study

Modern deep neural network (DNN) trainings utilize various training techniques, e.g., nonlinear activation functions, batch normalization, skip-connections, etc. Despite their effectiveness, it is still mysterious how they help accelerate…

Machine Learning · Computer Science 2024-03-05 Cheng Chen , Junjie Yang , Yi Zhou

Accelerating Very Deep Convolutional Networks for Classification and Detection

This paper aims to accelerate the test-time computation of convolutional neural networks (CNNs), especially very deep CNNs that have substantially impacted the computer vision community. Unlike previous methods that are designed for…

Computer Vision and Pattern Recognition · Computer Science 2015-11-19 Xiangyu Zhang , Jianhua Zou , Kaiming He , Jian Sun

Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning

Model-free deep reinforcement learning (RL) algorithms have been widely used for a range of complex control tasks. However, slow convergence and sample inefficiency remain challenging problems in RL, especially when handling continuous and…

Machine Learning · Computer Science 2021-12-07 Wenjie Shi , Shiji Song , Hui Wu , Ya-Chu Hsu , Cheng Wu , Gao Huang

Recent Advances in Convolutional Neural Network Acceleration

In recent years, convolutional neural networks (CNNs) have shown great performance in various fields such as image classification, pattern recognition, and multi-media compression. Two of the feature properties, local connectivity and…

Machine Learning · Computer Science 2018-07-24 Qianru Zhang , Meng Zhang , Tinghuan Chen , Zhifei Sun , Yuzhe Ma , Bei Yu

Learning Regularization Parameters of Inverse Problems via Deep Neural Networks

In this work, we describe a new approach that uses deep neural networks (DNN) to obtain regularization parameters for solving inverse problems. We consider a supervised learning approach, where a network is trained to approximate the…

Numerical Analysis · Mathematics 2021-04-15 Babak Maboudi Afkham , Julianne Chung , Matthias Chung

Multidimensional extrapolated global proximal gradient and applications for image processing

The proximal gradient method is a generic technique introduced to tackle the non-smoothness in optimization problems, wherein the objective function is expressed as the sum of a differentiable convex part and a non-differentiable…

Numerical Analysis · Mathematics 2024-01-19 Abdeslem Hafid Bentbib , Khalide Jbilou , Ridwane Tahiri

Deep Learning Approximation: Zero-Shot Neural Network Speedup

Neural networks offer high-accuracy solutions to a range of problems, but are costly to run in production systems because of computational and memory requirements during a forward pass. Given a trained network, we propose a techique called…

Computer Vision and Pattern Recognition · Computer Science 2018-06-18 Michele Pratusevich

Training Deep Neural Networks via Direct Loss Minimization

Supervised training of deep neural nets typically relies on minimizing cross-entropy. However, in many domains, we are interested in performing well on metrics specific to the application. In this paper we propose a direct loss minimization…

Machine Learning · Computer Science 2016-06-03 Yang Song , Alexander G. Schwing , Richard S. Zemel , Raquel Urtasun

Delta Networks for Optimized Recurrent Network Computation

Many neural networks exhibit stability in their activation patterns over time in response to inputs from sensors operating under real-world conditions. By capitalizing on this property of natural signals, we propose a Recurrent Neural…

Neural and Evolutionary Computing · Computer Science 2016-12-19 Daniel Neil , Jun Haeng Lee , Tobi Delbruck , Shih-Chii Liu

DNA: Differentiable Network-Accelerator Co-Search

Powerful yet complex deep neural networks (DNNs) have fueled a booming demand for efficient DNN solutions to bring DNN-powered intelligence into numerous applications. Jointly optimizing the networks and their accelerators are promising in…

Machine Learning · Computer Science 2025-01-07 Yongan Zhang , Yonggan Fu , Weiwen Jiang , Chaojian Li , Haoran You , Meng Li , Vikas Chandra , Yingyan Celine Lin

Nonlinear Acceleration of Stochastic Algorithms

Extrapolation methods use the last few iterates of an optimization algorithm to produce a better estimate of the optimum. They were shown to achieve optimal convergence rates in a deterministic setting using simple gradient iterates. Here,…

Optimization and Control · Mathematics 2017-08-04 Damien Scieur , Alexandre d'Aspremont , Francis Bach

Improving RNA Secondary Structure Design using Deep Reinforcement Learning

Rising costs in recent years of developing new drugs and treatments have led to extensive research in optimization techniques in biomolecular design. Currently, the most widely used approach in biomolecular design is directed evolution,…

Machine Learning · Computer Science 2021-11-09 Alexander Whatley , Zhekun Luo , Xiangru Tang

Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going

Deep neural networks have proven to be particularly effective in visual and audio recognition tasks. Existing models tend to be computationally expensive and memory intensive, however, and so methods for hardware-oriented approximation have…

Computer Vision and Pattern Recognition · Computer Science 2019-07-09 Erwei Wang , James J. Davis , Ruizhe Zhao , Ho-Cheung Ng , Xinyu Niu , Wayne Luk , Peter Y. K. Cheung , George A. Constantinides

Data augmentation instead of explicit regularization

Contrary to most machine learning models, modern deep artificial neural networks typically include multiple components that contribute to regularization. Despite the fact that some (explicit) regularization techniques, such as weight decay…

Computer Vision and Pattern Recognition · Computer Science 2020-11-13 Alex Hernández-García , Peter König

Empirically Accelerating Scaled Gradient Projection Using Deep Neural Network For Inverse Problems In Image Processing

Recently, deep neural networks (DNNs) have shown advantages in accelerating optimization algorithms. One approach is to unfold finite number of iterations of conventional optimization algorithms and to learn parameters in the algorithms.…

Machine Learning · Computer Science 2021-04-23 Byung Hyun Lee , Se Young Chun