Related papers: Trajectory-wise Control Variates for Variance Redu…

Coordinate-wise Control Variates for Deep Policy Gradients

The control variates (CV) method is widely used in policy gradient estimation to reduce the variance of the gradient estimators in practice. A control variate is applied by subtracting a baseline function from the state-action value…

Machine Learning · Computer Science 2021-08-12 Yuanyi Zhong , Yuan Zhou , Jian Peng

Meta-learning Control Variates: Variance Reduction with Limited Data

Control variates can be a powerful tool to reduce the variance of Monte Carlo estimators, but constructing effective control variates can be challenging when the number of samples is small. In this paper, we show that when a large number of…

Methodology · Statistics 2023-06-08 Zhuo Sun , Chris J. Oates , François-Xavier Briol

Neural Control Variates for Variance Reduction

In statistics and machine learning, approximation of an intractable integration is often achieved by using the unbiased Monte Carlo estimator, but the variances of the estimation are generally high in many applications. Control variates…

Machine Learning · Statistics 2019-10-16 Ruosi Wan , Mingjun Zhong , Haoyi Xiong , Zhanxing Zhu

Pathwise Gradient Variance Reduction with Control Variates in Variational Inference

Variational inference in Bayesian deep learning often involves computing the gradient of an expectation that lacks a closed-form solution. In these cases, pathwise and score-function gradient estimators are the most common approaches. The…

Machine Learning · Statistics 2024-10-10 Kenyon Ng , Susan Wei

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in practice but their performance suffers from the high variance of the gradient estimate. Several procedures were proposed to reduce it including…

Machine Learning · Computer Science 2022-06-16 Maxim Kaledin , Alexander Golubev , Denis Belomestny

Action-depedent Control Variates for Policy Optimization via Stein's Identity

Policy gradient methods have achieved remarkable successes in solving challenging reinforcement learning problems. However, it still often suffers from the large variance issue on policy gradient estimation, which leads to poor sample…

Machine Learning · Statistics 2018-02-26 Hao Liu , Yihao Feng , Yi Mao , Dengyong Zhou , Jian Peng , Qiang Liu

Training neural control variates using correlated configurations

Neural control variates (NCVs) have emerged as a powerful tool for variance reduction in Monte Carlo (MC) simulations, particularly in high-dimensional problems where traditional control variates are difficult to construct analytically. By…

High Energy Physics - Lattice · Physics 2025-08-22 Hyunwoo Oh

A Control Variate Approach for Improving Efficiency of Ensemble Monte Carlo

In this paper we present a new approach to control variates for improving computational efficiency of Ensemble Monte Carlo. We present the approach using simulation of paths of a time-dependent nonlinear stochastic equation. The core idea…

Computational Engineering, Finance, and Science · Computer Science 2008-09-25 T. Borogovac , F. J. Alexander , P. Vakili

Bayesian policy gradient and actor-critic algorithms

Policy gradient methods are reinforcement learning algorithms that adapt a parameterized policy by following a performance gradient estimate. Conventional policy gradient methods use Monte-Carlo techniques to estimate the gradient, which…

Machine Learning · Computer Science 2026-05-01 Mohammad Ghavamzadeh , Yaakov Engel , Michal Valko

Regularized Zero-Variance Control Variates

Zero-variance control variates (ZV-CV) are a post-processing method to reduce the variance of Monte Carlo estimators of expectations using the derivatives of the log target. Once the derivatives are available, the only additional…

Computation · Statistics 2022-08-17 Leah F. South , Chris J. Oates , Antonietta Mira , Christopher Drovandi

Vector-Valued Control Variates

Control variates are variance reduction tools for Monte Carlo estimators. They can provide significant variance reduction, but usually require a large number of samples, which can be prohibitive when sampling or evaluating the integrand is…

Methodology · Statistics 2023-06-08 Zhuo Sun , Alessandro Barp , François-Xavier Briol

Boosting CVaR Policy Optimization with Quantile Gradients

Optimizing Conditional Value-at-risk (CVaR) using policy gradient (a.k.a CVaR-PG) faces significant challenges of sample inefficiency. This inefficiency stems from the fact that it focuses on tail-end performance and overlooks many sampled…

Machine Learning · Computer Science 2026-02-06 Yudong Luo , Erick Delage

Online state vector reduction during model predictive control with gradient-based trajectory optimisation

Non-prehensile manipulation in high-dimensional systems is challenging for a variety of reasons. One of the main reasons is the computationally long planning times that come with a large state space. Trajectory optimisation algorithms have…

Robotics · Computer Science 2024-09-13 David Russell , Rafael Papallas , Mehmet Dogar

Trajectory-Based Off-Policy Deep Reinforcement Learning

Policy gradient methods are powerful reinforcement learning algorithms and have been demonstrated to solve many complex tasks. However, these methods are also data-inefficient, afflicted with high variance gradient estimates, and frequently…

Machine Learning · Computer Science 2019-05-15 Andreas Doerr , Michael Volpp , Marc Toussaint , Sebastian Trimpe , Christian Daniel

Control variates for variance-reduced ratio of means estimators

The control variates method is a classical variance reduction technique for Monte Carlo estimators that exploits correlated auxiliary variables without introducing bias. In many applications, the quantity of interest can be expressed as a…

Statistics Theory · Mathematics 2025-11-10 Louison Bocquet-Nouaille , Jérôme Morio , Benjamin Bobbia

Scalable Control Variates for Monte Carlo Methods via Stochastic Optimization

Control variates are a well-established tool to reduce the variance of Monte Carlo estimators. However, for large-scale problems including high-dimensional and large-sample settings, their advantages can be outweighed by a substantial…

Machine Learning · Statistics 2021-07-22 Shijing Si , Chris. J. Oates , Andrew B. Duncan , Lawrence Carin , François-Xavier Briol

Neural Control Variates

We propose neural control variates (NCV) for unbiased variance reduction in parametric Monte Carlo integration. So far, the core challenge of applying the method of control variates has been finding a good approximation of the integrand…

Machine Learning · Computer Science 2020-09-07 Thomas Müller , Fabrice Rousselle , Jan Novák , Alexander Keller

Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation

When optimising for conditional value at risk (CVaR) using policy gradients (PG), current methods rely on discarding a large proportion of trajectories, resulting in poor sample efficiency. We propose a reformulation of the CVaR…

Machine Learning · Computer Science 2025-07-22 Harry Mead , Clarissa Costen , Bruno Lacerda , Nick Hawes

Fast and Informative Model Selection using Learning Curve Cross-Validation

Common cross-validation (CV) methods like k-fold cross-validation or Monte-Carlo cross-validation estimate the predictive performance of a learner by repeatedly training it on a large portion of the given data and testing on the remaining…

Machine Learning · Computer Science 2021-11-30 Felix Mohr , Jan N. van Rijn

Conditional neural control variates for variance reduction in Bayesian inverse problems

Bayesian inference for inverse problems involves computing expectations under posterior distributions -- e.g., posterior means, variances, or predictive quantities -- typically via Monte Carlo (MC) estimation. When the quantity of interest…

Machine Learning · Statistics 2026-02-26 Ali Siahkoohi , Hyunwoo Oh