Related papers: Gradient-based Discrete Sampling with Automatic Cy…

Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning

The posteriors over neural network weights are high dimensional and multimodal. Each mode typically characterizes a meaningfully different representation of the data. We develop Cyclical Stochastic Gradient MCMC (SG-MCMC) to automatically…

Machine Learning · Computer Science 2020-05-13 Ruqi Zhang , Chunyuan Li , Jianyi Zhang , Changyou Chen , Andrew Gordon Wilson

Discrete Sampling using Semigradient-based Product Mixtures

We consider the problem of inference in discrete probabilistic models, that is, distributions over subsets of a finite ground set. These encompass a range of well-known models in machine learning, such as determinantal point processes and…

Machine Learning · Computer Science 2018-07-10 Alkis Gotovos , Hamed Hassani , Andreas Krause , Stefanie Jegelka

Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

We propose a general and scalable approximate sampling strategy for probabilistic models with discrete variables. Our approach uses gradients of the likelihood function with respect to its discrete inputs to propose updates in a…

Machine Learning · Computer Science 2021-06-08 Will Grathwohl , Kevin Swersky , Milad Hashemi , David Duvenaud , Chris J. Maddison

Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation

Gradients have been exploited in proposal distributions to accelerate the convergence of Markov chain Monte Carlo algorithms on discrete distributions. However, these methods require a natural differentiable extension of the target discrete…

Machine Learning · Computer Science 2023-02-28 Yue Xiang , Dongyao Zhu , Bowen Lei , Dongkuan Xu , Ruqi Zhang

Distributed Variable Sample-size Stochastic Optimization with Fixed Step-sizes

The paper considers distributed stochastic optimization over randomly switching networks, where agents collaboratively minimize the average of all agents' local expectation-valued convex cost functions. Due to the stochasticity in gradient…

Optimization and Control · Mathematics 2022-04-07 Jinlong Lei , Peng Yi , Jie Chen , Yiguang Hong

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Align Your Steps: Optimizing Sampling Schedules in Diffusion Models

Diffusion models (DMs) have established themselves as the state-of-the-art generative modeling approach in the visual domain and beyond. A crucial drawback of DMs is their slow sampling speed, relying on many sequential function evaluations…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Amirmojtaba Sabour , Sanja Fidler , Karsten Kreis

Adaptive Sampling Scheduler

Consistent distillation methods have evolved into effective techniques that significantly accelerate the sampling process of diffusion models. Although existing methods have achieved remarkable results, the selection of target timesteps…

Computer Vision and Pattern Recognition · Computer Science 2025-09-17 Qi Wang , Shuliang Zhu , Jinjia Zhou

A Distributed Flexible Delay-tolerant Proximal Gradient Algorithm

We develop and analyze an asynchronous algorithm for distributed convex optimization when the objective writes a sum of smooth functions, local to each worker, and a non-smooth function. Unlike many existing methods, our distributed…

Optimization and Control · Mathematics 2019-12-13 Konstantin Mishchenko , Franck Iutzeler , Jérôme Malick

Decentralized Markov Chain Gradient Descent

Decentralized stochastic gradient method emerges as a promising solution for solving large-scale machine learning problems. This paper studies the decentralized Markov chain gradient descent (DMGD) algorithm - a variant of the decentralized…

Optimization and Control · Mathematics 2021-04-14 Tao Sun , Dongsheng Li

Accelerated Markov Chain Monte Carlo Algorithms on Discrete States

We propose a class of discrete state sampling algorithms based on Nesterov's accelerated gradient method, which extends the classical Metropolis-Hastings (MH) algorithm. The evolution of the discrete states probability distribution governed…

Optimization and Control · Mathematics 2026-02-10 Bohan Zhou , Shu Liu , Xinzhe Zuo , Wuchen Li

Adjusted Scores for Discrete Langevin Algorithms

Sampling from discrete distributions is a ubiquitous task in machine learning, recently revisited by the emergence of discrete diffusion models. While Langevin algorithms constitute the state of the art for continuous spaces, discrete…

Statistics Theory · Mathematics 2026-02-18 Armand Gissler , Saeed Saremi , Francis Bach

Annealed Stein Variational Gradient Descent

Particle based optimization algorithms have recently been developed as sampling methods that iteratively update a set of particles to approximate a target distribution. In particular Stein variational gradient descent has gained attention…

Machine Learning · Computer Science 2021-03-19 Francesco D'Angelo , Vincent Fortuin

Stochastic Gradient Piecewise Deterministic Monte Carlo Samplers

Recent work has suggested using Monte Carlo methods based on piecewise deterministic Markov processes (PDMPs) to sample from target distributions of interest. PDMPs are non-reversible continuous-time processes endowed with momentum, and…

Machine Learning · Statistics 2024-06-28 Paul Fearnhead , Sebastiano Grazzi , Chris Nemeth , Gareth O. Roberts

Safe Adaptive Importance Sampling

Importance sampling has become an indispensable strategy to speed up optimization algorithms for large-scale applications. Improved adaptive variants - using importance values defined by the complete gradient information which changes…

Machine Learning · Computer Science 2017-11-08 Sebastian U. Stich , Anant Raj , Martin Jaggi

A Langevin-like Sampler for Discrete Distributions

We propose discrete Langevin proposal (DLP), a simple and scalable gradient-based proposal for sampling complex high-dimensional discrete distributions. In contrast to Gibbs sampling-based methods, DLP is able to update all coordinates in…

Machine Learning · Computer Science 2022-06-22 Ruqi Zhang , Xingchao Liu , Qiang Liu

Machine learning approach to chance-constrained problems: An algorithm based on the stochastic gradient descent

We consider chance-constrained problems with discrete random distribution. We aim for problems with a large number of scenarios. We propose a novel method based on the stochastic gradient descent method which performs updates of the…

Optimization and Control · Mathematics 2019-05-28 Lukáš Adam , Martin Branda

AutoSampling: Search for Effective Data Sampling Schedules

Data sampling acts as a pivotal role in training deep learning models. However, an effective sampling schedule is difficult to learn due to the inherently high dimension of parameters in learning the sampling schedule. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2021-05-31 Ming Sun , Haoxuan Dou , Baopu Li , Lei Cui , Junjie Yan , Wanli Ouyang

Enhancing Gradient-based Discrete Sampling via Parallel Tempering

While gradient-based discrete samplers are effective in sampling from complex distributions, they are susceptible to getting trapped in local minima, particularly in high-dimensional, multimodal discrete distributions, owing to the…

Machine Learning · Statistics 2025-05-21 Luxu Liang , Yuhang Jia , Feng Zhou

On Cyclical MCMC Sampling

Cyclical MCMC is a novel MCMC framework recently proposed by Zhang et al. (2019) to address the challenge posed by high-dimensional multimodal posterior distributions like those arising in deep learning. The algorithm works by generating a…

Computation · Statistics 2024-03-04 Liwei Wang , Xinru Liu , Aaron Smith , Yves Atchade