Related papers: Deep Radial-Basis Value Functions for Continuous C…

Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings

Learning continuous control in high-dimensional sparse reward settings, such as robotic manipulation, is a challenging problem due to the number of samples often required to obtain accurate optimal value and policy estimates. While many…

Robotics · Computer Science 2021-07-29 Sreehari Rammohan , Shangqun Yu , Bowen He , Eric Hsiung , Eric Rosen , Stefanie Tellex , George Konidaris

Value function estimation using conditional diffusion models for control

A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative…

Machine Learning · Computer Science 2023-06-14 Bogdan Mazoure , Walter Talbott , Miguel Angel Bautista , Devon Hjelm , Alexander Toshev , Josh Susskind

Parameter-Based Value Functions

Traditional off-policy actor-critic Reinforcement Learning (RL) algorithms learn value functions of a single target policy. However, when value functions are updated to track the learned policy, they forget potentially useful information…

Machine Learning · Computer Science 2021-08-16 Francesco Faccio , Louis Kirsch , Jürgen Schmidhuber

DeepLABNet: End-to-end Learning of Deep Radial Basis Networks with Fully Learnable Basis Functions

From fully connected neural networks to convolutional neural networks, the learned parameters within a neural network have been primarily relegated to the linear parameters (e.g., convolutional filters). The non-linear functions (e.g.,…

Neural and Evolutionary Computing · Computer Science 2019-11-22 Andrew Hryniowski , Alexander Wong

Recurrent Value Functions

Despite recent successes in Reinforcement Learning, value-based methods often suffer from high variance hindering performance. In this paper, we illustrate this in a continuous control setting where state of the art methods perform poorly…

Machine Learning · Computer Science 2019-05-24 Pierre Thodoroff , Nishanth Anand , Lucas Caccia , Doina Precup , Joelle Pineau

Learning State Representations from Random Deep Action-conditional Predictions

Our main contribution in this work is an empirical finding that random General Value Functions (GVFs), i.e., deep action-conditional predictions -- random both in what feature of observations they predict as well as in the sequence of…

Machine Learning · Computer Science 2021-11-09 Zeyu Zheng , Vivek Veeriah , Risto Vuorio , Richard Lewis , Satinder Singh

Reinforcement Learning using Augmented Neural Networks

Neural networks allow Q-learning reinforcement learning agents such as deep Q-networks (DQN) to approximate complex mappings from state spaces to value functions. However, this also brings drawbacks when compared to other function…

Machine Learning · Computer Science 2018-06-21 Jack Shannon , Marek Grzes

Representation Learning on Graphs: A Reinforcement Learning Application

In this work, we study value function approximation in reinforcement learning (RL) problems with high dimensional state or action spaces via a generalized version of representation policy iteration (RPI). We consider the limitations of…

Machine Learning · Computer Science 2019-01-18 Sephora Madjiheurem , Laura Toni

Radial Basis Function Networks for Convolutional Neural Networks to Learn Similarity Distance Metric and Improve Interpretability

Radial basis function neural networks (RBFs) are prime candidates for pattern classification and regression and have been used extensively in classical machine learning applications. However, RBFs have not been integrated into contemporary…

Computer Vision and Pattern Recognition · Computer Science 2022-08-25 Mohammadreza Amirian , Friedhelm Schwenker

A Deep Q-Network Based on Radial Basis Functions for Multi-Echelon Inventory Management

This paper addresses a multi-echelon inventory management problem with a complex network topology where deriving optimal ordering decisions is difficult. Deep reinforcement learning (DRL) has recently shown potential in solving such…

Machine Learning · Computer Science 2024-01-30 Liqiang Cheng , Jun Luo , Weiwei Fan , Yidong Zhang , Yuan Li

Normality-Guided Distributional Reinforcement Learning for Continuous Control

Learning a predictive model of the mean return, or value function, plays a critical role in many reinforcement learning algorithms. Distributional reinforcement learning (DRL) has been shown to improve performance by modeling the value…

Machine Learning · Computer Science 2025-07-08 Ju-Seung Byun , Andrew Perrault

Residual Deep Reinforcement Learning for Inverter-based Volt-Var Control

A residual deep reinforcement learning (RDRL) approach is proposed by integrating DRL with model-based optimization for inverter-based volt-var control in active distribution networks when the accurate power flow model is unknown. RDRL…

Systems and Control · Electrical Eng. & Systems 2024-08-14 Qiong Liu , Ye Guo , Lirong Deng , Haotian Liu , Dongyu Li , Hongbin Sun

Deep Q-Networks for Accelerating the Training of Deep Neural Networks

In this paper, we propose a principled deep reinforcement learning (RL) approach that is able to accelerate the convergence rate of general deep neural networks (DNNs). With our approach, a deep RL agent (synonym for optimizer in this work)…

Machine Learning · Computer Science 2017-07-14 Jie Fu

Visual Radial Basis Q-Network

While reinforcement learning (RL) from raw images has been largely investigated in the last decade, existing approaches still suffer from a number of constraints. The high input dimension is often handled using either expert knowledge to…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Julien Hautot , Céline Teuliere , Nourddine Azzaoui

Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods

In this paper, a novel optimal control-based baseline function is presented for the policy gradient method in deep reinforcement learning (RL). The baseline is obtained by computing the value function of an optimal control problem, which is…

Machine Learning · Computer Science 2024-11-07 Xubo Lyu , Site Li , Seth Siriya , Ye Pu , Mo Chen

Control Regularization for Reduced Variance Reinforcement Learning

Dealing with high variance is a significant challenge in model-free reinforcement learning (RL). Existing methods are unreliable, exhibiting high variance in performance from run to run using different initializations/seeds. Focusing on…

Machine Learning · Computer Science 2019-05-15 Richard Cheng , Abhinav Verma , Gabor Orosz , Swarat Chaudhuri , Yisong Yue , Joel W. Burdick

A Robotic Auto-Focus System based on Deep Reinforcement Learning

Considering its advantages in dealing with high-dimensional visual input and learning control policies in discrete domain, Deep Q Network (DQN) could be an alternative method of traditional auto-focus means in the future. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2018-09-11 Xiaofan Yu , Runze Yu , Jingsong Yang , Xiaohui Duan

Univariate Radial Basis Function Layers: Brain-inspired Deep Neural Layers for Low-Dimensional Inputs

Deep Neural Networks (DNNs) became the standard tool for function approximation with most of the introduced architectures being developed for high-dimensional input data. However, many real-world problems have low-dimensional inputs for…

Neural and Evolutionary Computing · Computer Science 2024-02-06 Daniel Jost , Basavasagar Patil , Xavier Alameda-Pineda , Chris Reinke

Deep Exploration via Randomized Value Functions

We study the use of randomized value functions to guide deep exploration in reinforcement learning. This offers an elegant means for synthesizing statistically and computationally efficient exploration with common practical approaches to…

Machine Learning · Statistics 2019-09-25 Ian Osband , Benjamin Van Roy , Daniel Russo , Zheng Wen

RLAF: Reinforcement Learning from Automaton Feedback

Reinforcement Learning (RL) in environments with complex, history-dependent reward structures poses significant challenges for traditional methods. In this work, we introduce a novel approach that leverages automaton-based feedback to guide…

Machine Learning · Computer Science 2025-10-20 Mahyar Alinejad , Alvaro Velasquez , Yue Wang , George Atia