Related papers: Deep Learning-Aided Trainable Projected Gradient D…

Gradient Descent based Optimization Algorithms for Deep Learning Models Training

In this paper, we aim at providing an introduction to the gradient descent based optimization algorithms for learning deep neural network models. Deep learning models involving multiple nonlinear projection layers are very challenging to…

Machine Learning · Computer Science 2019-03-12 Jiawei Zhang

A recipe of training neural network-based LDPC decoders

It is known belief propagation decoding variants of LDPC codes can be unrolled easily as neural networks after assigning differed weights to message passing edges flexibly. In this paper we focus on how to determine these weights, in the…

Information Theory · Computer Science 2022-08-10 Guangwen Li , Xiao Yu

Learning to Decode Protograph LDPC Codes

The recent development of deep learning methods provides a new approach to optimize the belief propagation (BP) decoding of linear codes. However, the limitation of existing works is that the scale of neural networks increases rapidly with…

Information Theory · Computer Science 2021-02-11 Jincheng Dai , Kailin Tan , Zhongwei Si , Kai Niu , Mingzhe Chen , H. Vincent Poor , Shuguang Cui

LPGD: A General Framework for Backpropagation through Embedded Optimization Layers

Embedding parameterized optimization problems as layers into machine learning architectures serves as a powerful inductive bias. Training such architectures with stochastic gradient descent requires care, as degenerate derivatives of the…

Machine Learning · Computer Science 2024-12-16 Anselm Paulus , Georg Martius , Vít Musil

Locally-Optimized Reweighted Belief Propagation for Decoding LDPC Codes with Finite-Length

In practice, LDPC codes are decoded using message passing methods. These methods offer good performance but tend to converge slowly and sometimes fail to converge and to decode the desired codewords correctly. Recently, tree-reweighted…

Information Theory · Computer Science 2014-03-05 J. Li , R. C. de Lamare , H. Wymeersch

A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training

Training deep neural networks (DNNs) efficiently is a challenge due to the associated highly nonconvex optimization. The backpropagation (backprop) algorithm has long been the most widely used algorithm for gradient computation of…

Machine Learning · Statistics 2018-03-28 Tim Tsz-Kit Lau , Jinshan Zeng , Baoyuan Wu , Yuan Yao

Robust Gradient Descent via Moment Encoding with LDPC Codes

This paper considers the problem of implementing large-scale gradient descent algorithms in a distributed computing setting in the presence of {\em straggling} processors. To mitigate the effect of the stragglers, it has been previously…

Machine Learning · Statistics 2019-01-04 Raj Kumar Maity , Ankit Singh Rawat , Arya Mazumdar

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Deep learning based enhancement of ordered statistics decoding of short LDPC codes

In the search for highly efficient decoders for short LDPC codes approaching maximum likelihood performance, a relayed decoding strategy, specifically activating the ordered statistics decoding process upon failure of a neural min-sum…

Information Theory · Computer Science 2024-03-26 Guangwen Li , Xiao Yu

Decoupled Parallel Backpropagation with Convergence Guarantee

Backpropagation algorithm is indispensable for the training of feedforward neural networks. It requires propagating error gradients sequentially from the output layer all the way back to the input layer. The backward locking in…

Machine Learning · Computer Science 2018-07-24 Zhouyuan Huo , Bin Gu , Qian Yang , Heng Huang

Proximal Decoding for LDPC-coded Massive MIMO Channels

We propose a novel optimization-based decoding algorithm for LDPC-coded massive MIMO channels. The proposed decoding algorithm is based on a proximal gradient method for solving an approximate maximum a posteriori (MAP) decoding problem.…

Information Theory · Computer Science 2021-02-11 Tadashi Wadayama , Satoshi Takabe

Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models: Extension

We consider distributed optimization under communication constraints for training deep learning models. We propose a new algorithm, whose parameter updates rely on two forces: a regular gradient step, and a corrective direction dictated by…

Machine Learning · Computer Science 2022-04-29 Yunfei Teng , Wenbo Gao , Francois Chalus , Anna Choromanska , Donald Goldfarb , Adrian Weller

Masked Training of Neural Networks with Partial Gradients

State-of-the-art training algorithms for deep learning models are based on stochastic gradient descent (SGD). Recently, many variations have been explored: perturbing parameters for better accuracy (such as in Extragradient), limiting SGD…

Machine Learning · Computer Science 2022-03-23 Amirkeivan Mohtashami , Martin Jaggi , Sebastian U. Stich

Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent

Coded distributed computation has become common practice for performing gradient descent on large datasets to mitigate stragglers and other faults. This paper proposes a novel algorithm that encodes the partial derivatives themselves and…

Machine Learning · Computer Science 2022-06-22 Pedro Soto , Ilia Ilmer , Haibin Guan , Jun Li

The ADMM penalized decoder for LDPC codes

Linear programming (LP) decoding for low-density parity-check (LDPC) codes proposed by Feldman et al. is shown to have theoretical guarantees in several regimes and empirically is not observed to suffer from an error floor. However at low…

Information Theory · Computer Science 2016-11-18 Xishuo Liu , Stark C. Draper

Train Feedfoward Neural Network with Layer-wise Adaptive Rate via Approximating Back-matching Propagation

Stochastic gradient descent (SGD) has achieved great success in training deep neural network, where the gradient is computed through back-propagation. However, the back-propagated values of different layers vary dramatically. This…

Machine Learning · Statistics 2018-02-28 Huishuai Zhang , Wei Chen , Tie-Yan Liu

Reweighted LP Decoding for LDPC Codes

We introduce a novel algorithm for decoding binary linear codes by linear programming. We build on the LP decoding algorithm of Feldman et al. and introduce a post-processing step that solves a second linear program that reweights the…

Information Theory · Computer Science 2011-03-16 Amin Khajehnejad , Alexandros G. Dimakis , Babak Hassibi , Benjamin Vigoda , William Bradley

Gradient Descent Bit Flipping Algorithms for Decoding LDPC Codes

A novel class of bit-flipping (BF) algorithms for decoding low-density parity-check (LDPC) codes is presented. The proposed algorithms, which are called gradient descent bit flipping (GDBF) algorithms, can be regarded as simplified gradient…

Information Theory · Computer Science 2008-04-08 Tadashi Wadayama , Keisuke Nakamura , Masayuki Yagita , Yuuki Funahashi , Shogo Usami , Ichi Takumi

Deep Learning-Aided Projected Gradient Detector for Massive Overloaded MIMO Channels

The paper presents a deep learning-aided iterative detection algorithm for massive overloaded MIMO systems. Since the proposed algorithm is based on the projected gradient descent method with trainable parameters, it is named as trainable…

Information Theory · Computer Science 2018-12-27 Satoshi Takabe , Masayuki Imanishi , Tadashi Wadayama , Kazunori Hayashi

Factor Graph Optimization of Error-Correcting Codes for Belief Propagation Decoding

The design of optimal linear block codes capable of being efficiently decoded is of major concern, especially for short block lengths. As near capacity-approaching codes, Low-Density Parity-Check (LDPC) codes possess several advantages over…

Information Theory · Computer Science 2024-10-11 Yoni Choukroun , Lior Wolf