Related papers: Gaussian Process Policy Optimization

Fast Model-based Policy Search for Universal Policy Networks

Adapting an agent's behaviour to new environments has been one of the primary focus areas of physics based reinforcement learning. Although recent approaches such as universal policy networks partially address this issue by enabling the…

Machine Learning · Computer Science 2022-02-15 Buddhika Laknath Semage , Thommen George Karimpanal , Santu Rana , Svetha Venkatesh

Safe Policy Search with Gaussian Process Models

We propose a method to optimise the parameters of a policy which will be used to safely perform a given task in a data-efficient manner. We train a Gaussian process model to capture the system dynamics, based on the PILCO framework. Our…

Machine Learning · Statistics 2019-12-03 Kyriakos Polymenakos , Alessandro Abate , Stephen Roberts

Learning Policies through Quantile Regression

Policy gradient based reinforcement learning algorithms coupled with neural networks have shown success in learning complex policies in the model free continuous action space control setting. However, explicitly parameterized policies are…

Machine Learning · Computer Science 2019-09-30 Oliver Richter , Roger Wattenhofer

Variational Policy Search using Sparse Gaussian Process Priors for Learning Multimodal Optimal Actions

Policy search reinforcement learning has been drawing much attention as a method of learning a robot control policy. In particular, policy search using such non-parametric policies as Gaussian process regression can learn optimal actions…

Robotics · Computer Science 2021-06-15 Hikaru Sasaki , Takamitsu Matsubara

Bayesian policy gradient and actor-critic algorithms

Policy gradient methods are reinforcement learning algorithms that adapt a parameterized policy by following a performance gradient estimate. Conventional policy gradient methods use Monte-Carlo techniques to estimate the gradient, which…

Machine Learning · Computer Science 2026-05-01 Mohammad Ghavamzadeh , Yaakov Engel , Michal Valko

GPU-Accelerated Policy Optimization via Batch Automatic Differentiation of Gaussian Processes for Real-World Control

The ability of Gaussian processes (GPs) to predict the behavior of dynamical systems as a more sample-efficient alternative to parametric models seems promising for real-world robotics research. However, the computational complexity of GPs…

Robotics · Computer Science 2022-03-01 Abdolreza Taheri , Joni Pajarinen , Reza Ghabcheloo

Learning Robust Controllers Via Probabilistic Model-Based Policy Search

Model-based Reinforcement Learning estimates the true environment through a world model in order to approximate the optimal policy. This family of algorithms usually benefits from better sample efficiency than their model-free counterparts.…

Machine Learning · Computer Science 2021-10-27 Valentin Charvet , Bjørn Sand Jensen , Roderick Murray-Smith

Deep Gaussian Process Proximal Policy Optimization

Uncertainty estimation for Reinforcement Learning (RL) is a critical component in control tasks where agents must balance safe exploration and efficient learning. While deep neural networks have enabled breakthroughs in RL, they often lack…

Machine Learning · Computer Science 2025-12-22 Matthijs van der Lende , Juan Cardenas-Cartagena

Adaptive Simulation-based Training of AI Decision-makers using Bayesian Optimization

This work studies how an AI-controlled dog-fighting agent with tunable decision-making parameters can learn to optimize performance against an intelligent adversary, as measured by a stochastic objective function evaluated on simulated…

Machine Learning · Computer Science 2017-08-01 Brett W. Israelsen , Nisar Ahmed , Kenneth Center , Roderick Green , Winston Bennett

Physics makes the difference: Bayesian optimization and active learning via augmented Gaussian process

Both experimental and computational methods for the exploration of structure, functionality, and properties of materials often necessitate the search across broad parameter spaces to discover optimal experimental conditions and regions of…

Computational Physics · Physics 2021-08-31 Maxim Ziatdinov , Ayana Ghosh , Sergei V. Kalinin

Inverse Reinforcement Learning with Gaussian Process

We present new algorithms for inverse reinforcement learning (IRL, or inverse optimal control) in convex optimization settings. We argue that finite-space IRL can be posed as a convex quadratic program under a Bayesian inference framework…

Machine Learning · Computer Science 2013-01-22 Qifeng Qiao , Peter A. Beling

Constrained Bayesian Optimization under Bivariate Gaussian Process with Application to Cure Process Optimization

Bayesian Optimization, leveraging Gaussian process models, has proven to be a powerful tool for minimizing expensive-to-evaluate objective functions by efficiently exploring the search space. Extensions such as constrained Bayesian…

Computation · Statistics 2025-06-03 Yezhuo Li , Qiong Zhang , Madhura Limaye , Gang Li

Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics

Guided policy search algorithms have been proven to work with incredible accuracy for not only controlling a complicated dynamical system, but also learning optimal policies from various unseen instances. One assumes true nature of the…

Systems and Control · Electrical Eng. & Systems 2020-10-02 Prakash Mallick , Zhiyong Chen , Mohsen Zamani

Optimization as Estimation with Gaussian Processes in Bandit Settings

Recently, there has been rising interest in Bayesian optimization -- the optimization of an unknown function with assumptions usually expressed by a Gaussian Process (GP) prior. We study an optimization strategy that directly uses an…

Machine Learning · Statistics 2018-08-14 Zi Wang , Bolei Zhou , Stefanie Jegelka

Computationally Efficient Bayesian Learning of Gaussian Process State Space Models

Gaussian processes allow for flexible specification of prior assumptions of unknown dynamics in state space models. We present a procedure for efficient Bayesian learning in Gaussian process state space models, where the representation is…

Computation · Statistics 2016-04-18 Andreas Svensson , Arno Solin , Simo Särkkä , Thomas B. Schön

Bayesian Optimization using Deep Gaussian Processes

Bayesian Optimization using Gaussian Processes is a popular approach to deal with the optimization of expensive black-box functions. However, because of the a priori on the stationarity of the covariance matrix of classic Gaussian…

Machine Learning · Statistics 2019-05-10 Ali Hebbal , Loic Brevault , Mathieu Balesdent , El-Ghazali Talbi , Nouredine Melab

Gaussian-Process-based Robot Learning from Demonstration

Endowed with higher levels of autonomy, robots are required to perform increasingly complex manipulation tasks. Learning from demonstration is arising as a promising paradigm for transferring skills to robots. It allows to implicitly learn…

Robotics · Computer Science 2023-02-24 Miguel Arduengo , Adrià Colomé , Joan Lobo-Prat , Luis Sentis , Carme Torras

Simulation Based Bayesian Optimization

Bayesian Optimization (BO) is a powerful method for optimizing black-box functions by combining prior knowledge with ongoing function evaluations. BO constructs a probabilistic surrogate model of the objective function given the covariates,…

Machine Learning · Statistics 2025-08-26 Roi Naveiro , Becky Tang

Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution

Reinforcement learning methods for continuous control tasks have evolved in recent years generating a family of policy gradient methods that rely primarily on a Gaussian distribution for modeling a stochastic policy. However, the Gaussian…

Machine Learning · Computer Science 2021-11-04 Irving G. B. Petrazzini , Eric A. Antonelo

Reparameterized Policy Learning for Multimodal Trajectory Optimization

We investigate the challenge of parametrizing policies for reinforcement learning (RL) in high-dimensional continuous action spaces. Our objective is to develop a multimodal policy that overcomes limitations inherent in the commonly-used…

Machine Learning · Computer Science 2023-07-21 Zhiao Huang , Litian Liang , Zhan Ling , Xuanlin Li , Chuang Gan , Hao Su