English
Related papers

Related papers: Deep Radial-Basis Value Functions for Continuous C…

200 papers

Learning continuous control in high-dimensional sparse reward settings, such as robotic manipulation, is a challenging problem due to the number of samples often required to obtain accurate optimal value and policy estimates. While many…

Robotics · Computer Science 2021-07-29 Sreehari Rammohan , Shangqun Yu , Bowen He , Eric Hsiung , Eric Rosen , Stefanie Tellex , George Konidaris

A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative…

Machine Learning · Computer Science 2023-06-14 Bogdan Mazoure , Walter Talbott , Miguel Angel Bautista , Devon Hjelm , Alexander Toshev , Josh Susskind

Traditional off-policy actor-critic Reinforcement Learning (RL) algorithms learn value functions of a single target policy. However, when value functions are updated to track the learned policy, they forget potentially useful information…

Machine Learning · Computer Science 2021-08-16 Francesco Faccio , Louis Kirsch , Jürgen Schmidhuber

From fully connected neural networks to convolutional neural networks, the learned parameters within a neural network have been primarily relegated to the linear parameters (e.g., convolutional filters). The non-linear functions (e.g.,…

Neural and Evolutionary Computing · Computer Science 2019-11-22 Andrew Hryniowski , Alexander Wong

Despite recent successes in Reinforcement Learning, value-based methods often suffer from high variance hindering performance. In this paper, we illustrate this in a continuous control setting where state of the art methods perform poorly…

Machine Learning · Computer Science 2019-05-24 Pierre Thodoroff , Nishanth Anand , Lucas Caccia , Doina Precup , Joelle Pineau

Our main contribution in this work is an empirical finding that random General Value Functions (GVFs), i.e., deep action-conditional predictions -- random both in what feature of observations they predict as well as in the sequence of…

Machine Learning · Computer Science 2021-11-09 Zeyu Zheng , Vivek Veeriah , Risto Vuorio , Richard Lewis , Satinder Singh

Neural networks allow Q-learning reinforcement learning agents such as deep Q-networks (DQN) to approximate complex mappings from state spaces to value functions. However, this also brings drawbacks when compared to other function…

Machine Learning · Computer Science 2018-06-21 Jack Shannon , Marek Grzes

In this work, we study value function approximation in reinforcement learning (RL) problems with high dimensional state or action spaces via a generalized version of representation policy iteration (RPI). We consider the limitations of…

Machine Learning · Computer Science 2019-01-18 Sephora Madjiheurem , Laura Toni

Radial basis function neural networks (RBFs) are prime candidates for pattern classification and regression and have been used extensively in classical machine learning applications. However, RBFs have not been integrated into contemporary…

Computer Vision and Pattern Recognition · Computer Science 2022-08-25 Mohammadreza Amirian , Friedhelm Schwenker

This paper addresses a multi-echelon inventory management problem with a complex network topology where deriving optimal ordering decisions is difficult. Deep reinforcement learning (DRL) has recently shown potential in solving such…

Machine Learning · Computer Science 2024-01-30 Liqiang Cheng , Jun Luo , Weiwei Fan , Yidong Zhang , Yuan Li

Learning a predictive model of the mean return, or value function, plays a critical role in many reinforcement learning algorithms. Distributional reinforcement learning (DRL) has been shown to improve performance by modeling the value…

Machine Learning · Computer Science 2025-07-08 Ju-Seung Byun , Andrew Perrault

A residual deep reinforcement learning (RDRL) approach is proposed by integrating DRL with model-based optimization for inverter-based volt-var control in active distribution networks when the accurate power flow model is unknown. RDRL…

Systems and Control · Electrical Eng. & Systems 2024-08-14 Qiong Liu , Ye Guo , Lirong Deng , Haotian Liu , Dongyu Li , Hongbin Sun

In this paper, we propose a principled deep reinforcement learning (RL) approach that is able to accelerate the convergence rate of general deep neural networks (DNNs). With our approach, a deep RL agent (synonym for optimizer in this work)…

Machine Learning · Computer Science 2017-07-14 Jie Fu

While reinforcement learning (RL) from raw images has been largely investigated in the last decade, existing approaches still suffer from a number of constraints. The high input dimension is often handled using either expert knowledge to…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Julien Hautot , Céline Teuliere , Nourddine Azzaoui

In this paper, a novel optimal control-based baseline function is presented for the policy gradient method in deep reinforcement learning (RL). The baseline is obtained by computing the value function of an optimal control problem, which is…

Machine Learning · Computer Science 2024-11-07 Xubo Lyu , Site Li , Seth Siriya , Ye Pu , Mo Chen

Dealing with high variance is a significant challenge in model-free reinforcement learning (RL). Existing methods are unreliable, exhibiting high variance in performance from run to run using different initializations/seeds. Focusing on…

Machine Learning · Computer Science 2019-05-15 Richard Cheng , Abhinav Verma , Gabor Orosz , Swarat Chaudhuri , Yisong Yue , Joel W. Burdick

Considering its advantages in dealing with high-dimensional visual input and learning control policies in discrete domain, Deep Q Network (DQN) could be an alternative method of traditional auto-focus means in the future. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2018-09-11 Xiaofan Yu , Runze Yu , Jingsong Yang , Xiaohui Duan

Deep Neural Networks (DNNs) became the standard tool for function approximation with most of the introduced architectures being developed for high-dimensional input data. However, many real-world problems have low-dimensional inputs for…

Neural and Evolutionary Computing · Computer Science 2024-02-06 Daniel Jost , Basavasagar Patil , Xavier Alameda-Pineda , Chris Reinke

We study the use of randomized value functions to guide deep exploration in reinforcement learning. This offers an elegant means for synthesizing statistically and computationally efficient exploration with common practical approaches to…

Machine Learning · Statistics 2019-09-25 Ian Osband , Benjamin Van Roy , Daniel Russo , Zheng Wen

Reinforcement Learning (RL) in environments with complex, history-dependent reward structures poses significant challenges for traditional methods. In this work, we introduce a novel approach that leverages automaton-based feedback to guide…

Machine Learning · Computer Science 2025-10-20 Mahyar Alinejad , Alvaro Velasquez , Yue Wang , George Atia
‹ Prev 1 2 3 10 Next ›