English
Related papers

Related papers: Monte Carlo Policy Gradient Method for Binary Opti…

200 papers

Variable selection is a key issue when analyzing high-dimensional data. The explosion of data with large sample sizes and dimensionality brings new challenges to this problem in both inference accuracy and computational complexity. To…

Methodology · Statistics 2016-11-30 Xu Chen , Shaan Qamar , Surya T. Tokdar

Optimal decision-making under partial observability requires agents to balance reducing uncertainty (exploration) against pursuing immediate objectives (exploitation). In this paper, we introduce a novel policy optimization framework for…

Machine Learning · Computer Science 2025-12-05 Hany Abdulsamad , Sahel Iqbal , Simo Särkkä

Markov chain Monte Carlo algorithms are used to simulate from complex statistical distributions by way of a local exploration of these distributions. This local feature avoids heavy requests on understanding the nature of the target, but it…

Computation · Statistics 2018-04-12 Christian P. Robert , Victor Elvira , Nick Tawn , Changye Wu

Gradient-based methods are often used for policy optimization in deep reinforcement learning, despite being vulnerable to local optima and saddle points. Although gradient-free methods (e.g., genetic algorithms or evolution strategies) help…

Machine Learning · Computer Science 2019-12-24 Xiaobai Ma , Katherine Driggs-Campbell , Zongzhang Zhang , Mykel J. Kochenderfer

The discrete nature of transmitted symbols poses challenges for achieving optimal detection in multiple-input multiple-output (MIMO) systems associated with a large number of antennas. Recently, the combination of two powerful machine…

Signal Processing · Electrical Eng. & Systems 2024-12-11 Xingyu Zhou , Le Liang , Jing Zhang , Chao-Kai Wen , Shi Jin

A Monte Carlo algorithm is said to be adaptive if it automatically calibrates its current proposal distribution using past simulations. The choice of the parametric family that defines the set of proposal distributions is critical for good…

Statistics Theory · Mathematics 2011-11-11 Christian Schäfer , Nicolas Chopin

In this paper, we present a Model-Based Reinforcement Learning (MBRL) algorithm named \emph{Monte Carlo Probabilistic Inference for Learning COntrol} (MC-PILCO). The algorithm relies on Gaussian Processes (GPs) to model the system dynamics…

Machine Learning · Computer Science 2022-11-29 Fabio Amadio , Alberto Dalla Libera , Riccardo Antonello , Daniel Nikovski , Ruggero Carli , Diego Romeres

Policy gradient (PG) is a reinforcement learning (RL) approach that optimizes a parameterized policy model for an expected return using gradient ascent. While PG can work well even in non-Markovian environments, it may encounter plateaus or…

Machine Learning · Computer Science 2024-07-08 Tetsuro Morimura , Kazuhiro Ota , Kenshi Abe , Peinan Zhang

Can Monte Carlo (MC) solvers be directly used in gradient-based methods for PDE-constrained optimization problems? In these problems, a gradient of the loss function is typically presented as a product of two PDE solutions, one for the…

Numerical Analysis · Mathematics 2022-09-27 Qin Li , Li Wang , Yunan Yang

We introduce a gradient-based learning method to automatically adapt Markov chain Monte Carlo (MCMC) proposal distributions to intractable targets. We define a maximum entropy regularised objective function, referred to as generalised speed…

Machine Learning · Statistics 2020-01-07 Michalis K. Titsias , Petros Dellaportas

Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are…

Optimization and Control · Mathematics 2021-05-18 Huan Xiong , Mengyang Yu , Li Liu , Fan Zhu , Fumin Shen , Ling Shao

While standard reinforcement learning optimizes a single reward signal, many applications require optimizing a nonlinear utility $f(J_1^\pi,\dots,J_M^\pi)$ over multiple objectives, where each $J_m^\pi$ denotes the expected discounted…

Machine Learning · Computer Science 2026-03-10 Swetha Ganesh , Vaneet Aggarwal

We propose a Markov Chain Monte Carlo (MCMC) algorithm based on Gibbs sampling with parallel tempering to solve nonlinear optimal control problems. The algorithm is applicable to nonlinear systems with dynamics that can be approximately…

Optimization and Control · Mathematics 2024-07-10 João Hespanha , Kerem Camsari

Binary optimization is a central problem in mathematical optimization and its applications are abundant. To solve this problem, we propose a new class of continuous optimization techniques which is based on Mathematical Programming with…

Optimization and Control · Mathematics 2017-12-07 Ganzhao Yuan , Bernard Ghanem

This paper deals with a complete bipartite matching problem with the objective of finding an optimal matching that maximizes a certain generic predefined utility function on the set of all matchings. After proving the NP-hardness of the…

Discrete Mathematics · Computer Science 2017-10-30 Shana Moothedath , Prasanna Chaporkar , Madhu N. Belur

Markov Chain Monte Carlo (MCMC) is a well-established family of algorithms primarily used in Bayesian statistics to sample from a target distribution when direct sampling is challenging. Existing work on Bayesian decision trees uses MCMC.…

Computation · Statistics 2023-01-24 Efthyvoulos Drousiotis , Paul G. Spirakis , Simon Maskell

In many hierarchical inverse problems, not only do we want to estimate high- or infinite-dimensional model parameters in the parameter-to-observable maps, but we also have to estimate hyperparameters that represent critical assumptions in…

Computation · Statistics 2020-02-18 Johnathan Bardsley , Tiangang Cui

Stochastic gradient Markov chain Monte Carlo (MCMC) algorithms have received much attention in Bayesian computing for big data problems, but they are only applicable to a small class of problems for which the parameter space has a fixed…

Computation · Statistics 2020-02-10 Qifan Song , Yan Sun , Mao Ye , Faming Liang

The problem of optimally scaling the proposal distribution in a Markov chain Monte Carlo algorithm is critical to the quality of the generated samples. Much work has gone into obtaining such results for various Metropolis-Hastings (MH)…

Computation · Statistics 2022-02-07 Sanket Agrawal , Dootika Vats , Krzysztof Łatuszyński , Gareth O. Roberts

We consider stochastic optimization when one only has access to biased stochastic oracles of the objective and the gradient, and obtaining stochastic gradients with low biases comes at high costs. This setting captures various optimization…

Optimization and Control · Mathematics 2024-08-22 Yifan Hu , Jie Wang , Xin Chen , Niao He
‹ Prev 1 2 3 10 Next ›