English
Related papers

Related papers: Trajectory-wise Control Variates for Variance Redu…

200 papers

The control variates (CV) method is widely used in policy gradient estimation to reduce the variance of the gradient estimators in practice. A control variate is applied by subtracting a baseline function from the state-action value…

Machine Learning · Computer Science 2021-08-12 Yuanyi Zhong , Yuan Zhou , Jian Peng

Control variates can be a powerful tool to reduce the variance of Monte Carlo estimators, but constructing effective control variates can be challenging when the number of samples is small. In this paper, we show that when a large number of…

Methodology · Statistics 2023-06-08 Zhuo Sun , Chris J. Oates , François-Xavier Briol

In statistics and machine learning, approximation of an intractable integration is often achieved by using the unbiased Monte Carlo estimator, but the variances of the estimation are generally high in many applications. Control variates…

Machine Learning · Statistics 2019-10-16 Ruosi Wan , Mingjun Zhong , Haoyi Xiong , Zhanxing Zhu

Variational inference in Bayesian deep learning often involves computing the gradient of an expectation that lacks a closed-form solution. In these cases, pathwise and score-function gradient estimators are the most common approaches. The…

Machine Learning · Statistics 2024-10-10 Kenyon Ng , Susan Wei

Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in practice but their performance suffers from the high variance of the gradient estimate. Several procedures were proposed to reduce it including…

Machine Learning · Computer Science 2022-06-16 Maxim Kaledin , Alexander Golubev , Denis Belomestny

Policy gradient methods have achieved remarkable successes in solving challenging reinforcement learning problems. However, it still often suffers from the large variance issue on policy gradient estimation, which leads to poor sample…

Machine Learning · Statistics 2018-02-26 Hao Liu , Yihao Feng , Yi Mao , Dengyong Zhou , Jian Peng , Qiang Liu

Neural control variates (NCVs) have emerged as a powerful tool for variance reduction in Monte Carlo (MC) simulations, particularly in high-dimensional problems where traditional control variates are difficult to construct analytically. By…

High Energy Physics - Lattice · Physics 2025-08-22 Hyunwoo Oh

In this paper we present a new approach to control variates for improving computational efficiency of Ensemble Monte Carlo. We present the approach using simulation of paths of a time-dependent nonlinear stochastic equation. The core idea…

Computational Engineering, Finance, and Science · Computer Science 2008-09-25 T. Borogovac , F. J. Alexander , P. Vakili

Policy gradient methods are reinforcement learning algorithms that adapt a parameterized policy by following a performance gradient estimate. Conventional policy gradient methods use Monte-Carlo techniques to estimate the gradient, which…

Machine Learning · Computer Science 2026-05-01 Mohammad Ghavamzadeh , Yaakov Engel , Michal Valko

Zero-variance control variates (ZV-CV) are a post-processing method to reduce the variance of Monte Carlo estimators of expectations using the derivatives of the log target. Once the derivatives are available, the only additional…

Computation · Statistics 2022-08-17 Leah F. South , Chris J. Oates , Antonietta Mira , Christopher Drovandi

Control variates are variance reduction tools for Monte Carlo estimators. They can provide significant variance reduction, but usually require a large number of samples, which can be prohibitive when sampling or evaluating the integrand is…

Methodology · Statistics 2023-06-08 Zhuo Sun , Alessandro Barp , François-Xavier Briol

Optimizing Conditional Value-at-risk (CVaR) using policy gradient (a.k.a CVaR-PG) faces significant challenges of sample inefficiency. This inefficiency stems from the fact that it focuses on tail-end performance and overlooks many sampled…

Machine Learning · Computer Science 2026-02-06 Yudong Luo , Erick Delage

Non-prehensile manipulation in high-dimensional systems is challenging for a variety of reasons. One of the main reasons is the computationally long planning times that come with a large state space. Trajectory optimisation algorithms have…

Robotics · Computer Science 2024-09-13 David Russell , Rafael Papallas , Mehmet Dogar

Policy gradient methods are powerful reinforcement learning algorithms and have been demonstrated to solve many complex tasks. However, these methods are also data-inefficient, afflicted with high variance gradient estimates, and frequently…

Machine Learning · Computer Science 2019-05-15 Andreas Doerr , Michael Volpp , Marc Toussaint , Sebastian Trimpe , Christian Daniel

The control variates method is a classical variance reduction technique for Monte Carlo estimators that exploits correlated auxiliary variables without introducing bias. In many applications, the quantity of interest can be expressed as a…

Statistics Theory · Mathematics 2025-11-10 Louison Bocquet-Nouaille , Jérôme Morio , Benjamin Bobbia

Control variates are a well-established tool to reduce the variance of Monte Carlo estimators. However, for large-scale problems including high-dimensional and large-sample settings, their advantages can be outweighed by a substantial…

Machine Learning · Statistics 2021-07-22 Shijing Si , Chris. J. Oates , Andrew B. Duncan , Lawrence Carin , François-Xavier Briol

We propose neural control variates (NCV) for unbiased variance reduction in parametric Monte Carlo integration. So far, the core challenge of applying the method of control variates has been finding a good approximation of the integrand…

Machine Learning · Computer Science 2020-09-07 Thomas Müller , Fabrice Rousselle , Jan Novák , Alexander Keller

When optimising for conditional value at risk (CVaR) using policy gradients (PG), current methods rely on discarding a large proportion of trajectories, resulting in poor sample efficiency. We propose a reformulation of the CVaR…

Machine Learning · Computer Science 2025-07-22 Harry Mead , Clarissa Costen , Bruno Lacerda , Nick Hawes

Common cross-validation (CV) methods like k-fold cross-validation or Monte-Carlo cross-validation estimate the predictive performance of a learner by repeatedly training it on a large portion of the given data and testing on the remaining…

Machine Learning · Computer Science 2021-11-30 Felix Mohr , Jan N. van Rijn

Bayesian inference for inverse problems involves computing expectations under posterior distributions -- e.g., posterior means, variances, or predictive quantities -- typically via Monte Carlo (MC) estimation. When the quantity of interest…

Machine Learning · Statistics 2026-02-26 Ali Siahkoohi , Hyunwoo Oh
‹ Prev 1 2 3 10 Next ›