Related papers: Stochastic Learning for Sparse Discrete Markov Ran…

Stochastic Proximal Gradient Algorithms for Penalized Mixed Models

Motivated by penalized likelihood maximization in complex models, we study optimization problems where neither the function to optimize nor its gradient have an explicit expression, but its gradient can be approximated by a Monte Carlo…

Computation · Statistics 2017-09-28 Gersende Fort , Edouard Ollier , Adeline Samson

Scalable Inference of Sparsely-changing Markov Random Fields with Strong Statistical Guarantees

In this paper, we study the problem of inferring time-varying Markov random fields (MRF), where the underlying graphical model is both sparse and changes sparsely over time. Most of the existing methods for the inference of time-varying…

Machine Learning · Computer Science 2021-02-09 Salar Fattahi , Andres Gomez

Solution Path of Time-varying Markov Random Fields with Discrete Regularization

We study the problem of inferring sparse time-varying Markov random fields (MRFs) with different discrete and temporal regularizations on the parameters. Due to the intractability of discrete regularization, most approaches for solving this…

Optimization and Control · Mathematics 2023-07-27 Salar Fattahi , Andres Gomez

A New Stochastic Approximation Method for Gradient-based Simulated Parameter Estimation

This paper tackles the challenge of parameter calibration in stochastic models, particularly in scenarios where the likelihood function is unavailable in an analytical form. We introduce a gradient-based simulated parameter estimation…

Machine Learning · Statistics 2025-03-25 Zehao Li , Yijie Peng

Efficient Graph Laplacian Estimation by Proximal Newton

The Laplacian-constrained Gaussian Markov Random Field (LGMRF) is a common multivariate statistical model for learning a weighted sparse dependency graph from given data. This graph learning problem can be formulated as a maximum likelihood…

Machine Learning · Computer Science 2024-04-15 Yakov Medvedovsky , Eran Treister , Tirza Routtenberg

On Learning Continuous Pairwise Markov Random Fields

We consider learning a sparse pairwise Markov Random Field (MRF) with continuous-valued variables from i.i.d samples. We adapt the algorithm of Vuffray et al. (2019) to this setting and provide finite-sample analysis revealing sample…

Machine Learning · Computer Science 2020-10-29 Abhin Shah , Devavrat Shah , Gregory W. Wornell

Effective Proximal Methods for Non-convex Non-smooth Regularized Learning

Sparse learning is a very important tool for mining useful information and patterns from high dimensional data. Non-convex non-smooth regularized learning problems play essential roles in sparse learning, and have drawn extensive attentions…

Machine Learning · Computer Science 2020-10-22 Guannan Liang , Qianqian Tong , Jiahao Ding , Miao Pan , Jinbo Bi

Stochastic Variance-Reduced Policy Gradient

In this paper, we propose a novel reinforcement- learning algorithm consisting in a stochastic variance-reduced version of policy gradient for solving Markov Decision Processes (MDPs). Stochastic variance-reduced gradient (SVRG) methods…

Machine Learning · Computer Science 2018-06-15 Matteo Papini , Damiano Binaghi , Giuseppe Canonaco , Matteo Pirotta , Marcello Restelli

Relational Neural Markov Random Fields

Statistical Relational Learning (SRL) models have attracted significant attention due to their ability to model complex data while handling uncertainty. However, most of these models have been limited to discrete domains due to their…

Machine Learning · Computer Science 2021-10-20 Yuqiao Chen , Sriraam Natarajan , Nicholas Ruozzi

Maximum likelihood estimation of regularisation parameters in high-dimensional inverse problems: an empirical Bayesian approach. Part II: Theoretical Analysis

This paper presents a detailed theoretical analysis of the three stochastic approximation proximal gradient algorithms proposed in our companion paper [49] to set regularization parameters by marginal maximum likelihood estimation. We prove…

Statistics Theory · Mathematics 2020-08-14 Valentin De Bortoli , Alain Durmus , Ana F. Vidal , Marcelo Pereyra

Constrained Approximate Maximum Entropy Learning of Markov Random Fields

Parameter estimation in Markov random fields (MRFs) is a difficult task, in which inference over the network is run in the inner loop of a gradient descent procedure. Replacing exact inference with approximate methods such as loopy belief…

Machine Learning · Computer Science 2012-06-18 Varun Ganapathi , David Vickrey , John Duchi , Daphne Koller

On the Stochastic (Variance-Reduced) Proximal Gradient Method for Regularized Expected Reward Optimization

We consider a regularized expected reward optimization problem in the non-oblivious setting that covers many existing problems in reinforcement learning (RL). In order to solve such an optimization problem, we apply and analyze the…

Machine Learning · Computer Science 2024-08-21 Ling Liang , Haizhao Yang

Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations

We develop the mathematical foundations of the stochastic modified equations (SME) framework for analyzing the dynamics of stochastic gradient algorithms, where the latter is approximated by a class of stochastic differential equations with…

Machine Learning · Computer Science 2018-11-06 Qianxiao Li , Cheng Tai , Weinan E

Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning

Stochastic gradient methods for minimizing nonconvex composite objective functions typically rely on the Lipschitz smoothness of the differentiable part, but this assumption fails in many important problem classes like quadratic inverse…

Optimization and Control · Mathematics 2025-01-22 Kuangyu Ding , Jingyang Li , Kim-Chuan Toh

Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming

In this paper, we introduce a new stochastic approximation (SA) type algorithm, namely the randomized stochastic gradient (RSG) method, for solving an important class of nonlinear (possibly nonconvex) stochastic programming (SP) problems.…

Optimization and Control · Mathematics 2015-10-27 Saeed Ghadimi , Guanghui Lan

Stochastic Markov Gradient Descent and Training Low-Bit Neural Networks

The massive size of modern neural networks has motivated substantial recent interest in neural network quantization. We introduce Stochastic Markov Gradient Descent (SMGD), a discrete optimization method applicable to training quantized…

Machine Learning · Computer Science 2020-12-23 Jonathan Ashbrock , Alexander M. Powell

Stochastic Proximal Gradient Algorithm with Minibatches. Application to Large Scale Learning Models

Stochastic optimization lies at the core of most statistical learning models. The recent great development of stochastic algorithmic tools focused significantly onto proximal gradient iterations, in order to find an efficient approach for…

Machine Learning · Computer Science 2020-03-31 Andrei Patrascu , Ciprian Paduraru , Paul Irofti

Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels

Selecting hyperparameters in deep learning greatly impacts its effectiveness but requires manual effort and expertise. Recent works show that Bayesian model selection with Laplace approximations can allow to optimize such hyperparameters…

Machine Learning · Statistics 2023-06-08 Alexander Immer , Tycho F. A. van der Ouderaa , Mark van der Wilk , Gunnar Rätsch , Bernhard Schölkopf

A Stochastic Proximal Method for Nonsmooth Regularized Finite Sum Optimization

We consider the problem of training a deep neural network with nonsmooth regularization to retrieve a sparse and efficient sub-structure. Our regularizer is only assumed to be lower semi-continuous and prox-bounded. We combine an adaptive…

Machine Learning · Statistics 2022-06-20 Dounia Lakhmiri , Dominique Orban , Andrea Lodi

Projected Subgradient Methods for Learning Sparse Gaussians

Gaussian Markov random fields (GMRFs) are useful in a broad range of applications. In this paper we tackle the problem of learning a sparse GMRF in a high-dimensional space. Our approach uses the l1-norm as a regularization on the inverse…

Machine Learning · Computer Science 2012-06-18 John Duchi , Stephen Gould , Daphne Koller