Related papers: Message Passing Descent for Efficient Machine Lear…

Message Passing Stein Variational Gradient Descent

Stein variational gradient descent (SVGD) is a recently proposed particle-based Bayesian inference method, which has attracted a lot of interest due to its remarkable approximation ability and particle efficiency compared to traditional…

Machine Learning · Statistics 2018-06-11 Jingwei Zhuo , Chang Liu , Jiaxin Shi , Jun Zhu , Ning Chen , Bo Zhang

Rethinking Message Passing Neural Networks with Diffusion Distance-guided Stress Majorization

Message passing neural networks (MPNNs) have emerged as go-to models for learning on graph-structured data in the past decade. Despite their effectiveness, most of such models still incur severe issues such as over-smoothing and…

Machine Learning · Computer Science 2025-11-26 Haoran Zheng , Renchi Yang , Yubo Zhou , Jianliang Xu

Deep learning via message passing algorithms based on belief propagation

Message-passing algorithms based on the Belief Propagation (BP) equations constitute a well-known distributed computational scheme. It is exact on tree-like graphical models and has also proven to be effective in many problems defined on…

Machine Learning · Computer Science 2022-07-20 Carlo Lucibello , Fabrizio Pittorino , Gabriele Perugini , Riccardo Zecchina

Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models: Extension

We consider distributed optimization under communication constraints for training deep learning models. We propose a new algorithm, whose parameter updates rely on two forces: a regular gradient step, and a corrective direction dictated by…

Machine Learning · Computer Science 2022-04-29 Yunfei Teng , Wenbo Gao , Francois Chalus , Anna Choromanska , Donald Goldfarb , Adrian Weller

Accelerating Federated Learning via Momentum Gradient Descent

Federated learning (FL) provides a communication-efficient approach to solve machine learning problems concerning distributed data, without sending raw data to a central server. However, existing works on FL only utilize first-order…

Machine Learning · Computer Science 2019-10-10 Wei Liu , Li Chen , Yunfei Chen , Wenyi Zhang

A Diffusion Approximation Theory of Momentum SGD in Nonconvex Optimization

Momentum Stochastic Gradient Descent (MSGD) algorithm has been widely applied to many nonconvex optimization problems in machine learning, e.g., training deep neural networks, variational Bayesian inference, and etc. Despite its empirical…

Machine Learning · Computer Science 2021-03-09 Tianyi Liu , Zhehui Chen , Enlu Zhou , Tuo Zhao

Distributed Cooperative Positioning in Dense Wireless Networks: A Neural Network Enhanced Fast Convergent Parametric Message Passing Method

Parametric message passing (MP) is a promising technique that provides reliable marginal probability distributions for distributed cooperative positioning (DCP) based on factor graphs (FG), while maintaining minimal computational…

Signal Processing · Electrical Eng. & Systems 2025-05-20 Yue Cao , Shaoshi Yang , Zhiyong Feng

DP-FP: Differentially Private Forward Propagation for Large Models

When applied to large-scale learning problems, the conventional wisdom on privacy-preserving deep learning, known as Differential Private Stochastic Gradient Descent (DP-SGD), has met with limited success due to significant performance…

Machine Learning · Computer Science 2021-12-30 Jian Du , Haitao Mi

A Masked Pruning Approach for Dimensionality Reduction in Communication-Efficient Federated Learning Systems

Federated Learning (FL) represents a growing machine learning (ML) paradigm designed for training models across numerous nodes that retain local datasets, all without directly exchanging the underlying private data with the parameter server…

Machine Learning · Computer Science 2023-12-08 Tamir L. S. Gez , Kobi Cohen

Stochastic Gradient Variance Reduction by Solving a Filtering Problem

Deep neural networks (DNN) are typically optimized using stochastic gradient descent (SGD). However, the estimation of the gradient using stochastic samples tends to be noisy and unreliable, resulting in large gradient variance and bad…

Machine Learning · Computer Science 2021-05-18 Xingyi Yang

Dynamic Momentum Recalibration in Online Gradient Learning

Stochastic Gradient Descent (SGD) and its momentum variants form the backbone of deep learning optimization, yet the underlying dynamics of their gradient behavior remain insufficiently understood. In this work, we reinterpret gradient…

Machine Learning · Computer Science 2026-03-09 Zhipeng Yao , Rui Yu , Guisong Chang , Ying Li , Yu Zhang , Dazhou Li

GD-FPS: Growth-Driven Feedforward Parameter Selection for Efficient Fine-Tuning

Parameter-Efficient Fine-Tuning (PEFT) has emerged as a key strategy for adapting large-scale pre-trained models to downstream tasks, but existing approaches face notable limitations. Addition-based methods, such as Adapters, introduce…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Kenneth Yang , Wen-Li Wei , Jen-Chun Lin

Bayesian Deep Learning Via Expectation Maximization and Turbo Deep Approximate Message Passing

Efficient learning and model compression algorithm for deep neural network (DNN) is a key workhorse behind the rise of deep learning (DL). In this work, we propose a message passing based Bayesian deep learning algorithm called EM-TDAMP to…

Machine Learning · Computer Science 2024-06-11 Wei Xu , An Liu , Yiting Zhang , Vincent Lau

Proactive Message Passing on Memory Factor Networks

We introduce a new type of graphical model that we call a "memory factor network" (MFN). We show how to use MFNs to model the structure inherent in many types of data sets. We also introduce an associated message-passing style algorithm…

Artificial Intelligence · Computer Science 2016-01-19 Patrick Eschenfeldt , Dan Schmidt , Stark Draper , Jonathan Yedidia

Partitioning Message Passing for Graph Fraud Detection

Label imbalance and homophily-heterophily mixture are the fundamental problems encountered when applying Graph Neural Networks (GNNs) to Graph Fraud Detection (GFD) tasks. Existing GNN-based GFD models are designed to augment graph…

Artificial Intelligence · Computer Science 2024-12-03 Wei Zhuo , Zemin Liu , Bryan Hooi , Bingsheng He , Guang Tan , Rizal Fathony , Jia Chen

Message Passing and Combinatorial Optimization

Graphical models use the intuitive and well-studied methods of graph theory to implicitly represent dependencies between variables in large systems. They can model the global behaviour of a complex system by specifying only local factors.…

Artificial Intelligence · Computer Science 2015-08-21 Siamak Ravanbakhsh

A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks

In distributed training of deep neural networks, people usually run Stochastic Gradient Descent (SGD) or its variants on each machine and communicate with other machines periodically. However, SGD might converge slowly in training some deep…

Machine Learning · Computer Science 2022-10-14 Mingrui Liu , Zhenxun Zhuang , Yunwei Lei , Chunyang Liao

Network Gradient Descent Algorithm for Decentralized Federated Learning

We study a fully decentralized federated learning algorithm, which is a novel gradient descent algorithm executed on a communication-based network. For convenience, we refer to it as a network gradient descent (NGD) method. In the NGD…

Machine Learning · Computer Science 2022-05-18 Shuyuan Wu , Danyang Huang , Hansheng Wang

Distributed Machine Learning via Sufficient Factor Broadcasting

Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in machine learning (ML) applications ranging from computer vision to computational biology. When these models are applied to large-scale ML…

Machine Learning · Computer Science 2015-11-30 Pengtao Xie , Jin Kyu Kim , Yi Zhou , Qirong Ho , Abhimanu Kumar , Yaoliang Yu , Eric Xing

Distributed Machine Learning via Sufficient Factor Broadcasting

Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in machine learning (ML) applications ranging from computer vision to computational biology. When these models are applied to large-scale ML…

Machine Learning · Computer Science 2015-09-08 Pengtao Xie , Jin Kyu Kim , Yi Zhou , Qirong Ho , Abhimanu Kumar , Yaoliang Yu , Eric Xing