Related papers: Value Prediction Network

VPNet: Variable Projection Networks

We introduce VPNet, a novel model-driven neural network architecture based on variable projection (VP). Applying VP operators to neural networks results in learnable features, interpretable parameters, and compact network structures. This…

Machine Learning · Computer Science 2021-10-22 Péter Kovács , Gergő Bognár , Christian Huber , Mario Huemer

Dynamic Planning Networks

We introduce Dynamic Planning Networks (DPN), a novel architecture for deep reinforcement learning, that combines model-based and model-free aspects for online planning. Our architecture learns to dynamically construct plans using a learned…

Machine Learning · Computer Science 2019-02-05 Norman Tasfi , Miriam Capretz

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning

Deep reinforcement learning (RL) algorithms suffer severe performance degradation when the interaction data is scarce, which limits their real-world application. Recently, visual representation learning has been shown to be effective and…

Machine Learning · Computer Science 2022-08-17 Yang Yue , Bingyi Kang , Zhongwen Xu , Gao Huang , Shuicheng Yan

Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders

Deep Reinforcement Learning (RL) is unquestionably a robust framework to train autonomous agents in a wide variety of disciplines. However, traditional deep and shallow model-free RL algorithms suffer from low sample efficiency and…

Machine Learning · Computer Science 2022-10-05 Per-Arne Andersen , Ole-Christoffer Granmo , Morten Goodwin

Reinforcement Learning via Recurrent Convolutional Neural Networks

Deep Reinforcement Learning has enabled the learning of policies for complex tasks in partially observable environments, without explicitly learning the underlying model of the tasks. While such model-free methods achieve considerable…

Machine Learning · Computer Science 2017-01-11 Tanmay Shankar , Santosha K. Dwivedy , Prithwijit Guha

Reliable Probabilistic Classification with Neural Networks

Venn Prediction (VP) is a new machine learning framework for producing well-calibrated probabilistic predictions. In particular it provides well-calibrated lower and upper bounds for the conditional probability of an example belonging to…

Machine Learning · Computer Science 2023-12-18 Harris Papadopoulos

Visual Sensor Network Reconfiguration with Deep Reinforcement Learning

We present an approach for reconfiguration of dynamic visual sensor networks with deep reinforcement learning (RL). Our RL agent uses a modified asynchronous advantage actor-critic framework and the recently proposed Relational Network…

Machine Learning · Computer Science 2018-08-14 Paul Jasek , Bernard Abayowa

Deep Quality-Value (DQV) Learning

We introduce a novel Deep Reinforcement Learning (DRL) algorithm called Deep Quality-Value (DQV) Learning. DQV uses temporal-difference learning to train a Value neural network and uses this network for training a second Quality-value…

Machine Learning · Statistics 2018-10-11 Matthia Sabatelli , Gilles Louppe , Pierre Geurts , Marco A. Wiering

General Value Function Networks

State construction is important for learning in partially observable environments. A general purpose strategy for state construction is to learn the state update using a Recurrent Neural Network (RNN), which updates the internal state using…

Machine Learning · Computer Science 2021-02-03 Matthew Schlegel , Andrew Jacobsen , Zaheer Abbas , Andrew Patterson , Adam White , Martha White

Value-driven Hindsight Modelling

Value estimation is a critical component of the reinforcement learning (RL) paradigm. The question of how to effectively learn value predictors from data is one of the major problems studied by the RL community, and different approaches…

Machine Learning · Computer Science 2020-10-22 Arthur Guez , Fabio Viola , Théophane Weber , Lars Buesing , Steven Kapturowski , Doina Precup , David Silver , Nicolas Heess

Uniform State Abstraction For Reinforcement Learning

Potential Based Reward Shaping combined with a potential function based on appropriately defined abstract knowledge has been shown to significantly improve learning speed in Reinforcement Learning. MultiGrid Reinforcement Learning (MRL) has…

Machine Learning · Computer Science 2020-04-08 John Burden , Daniel Kudenko

Dueling Network Architectures for Deep Reinforcement Learning

In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this…

Machine Learning · Computer Science 2016-04-06 Ziyu Wang , Tom Schaul , Matteo Hessel , Hado van Hasselt , Marc Lanctot , Nando de Freitas

Pathfinding in Random Partially Observable Environments with Vision-Informed Deep Reinforcement Learning

Deep reinforcement learning is a technique for solving problems in a variety of environments, ranging from Atari video games to stock trading. This method leverages deep neural network models to make decisions based on observations of a…

Machine Learning · Computer Science 2022-09-13 Anthony Dowling

Random Projection in Deep Neural Networks

This work investigates the ways in which deep learning methods can benefit from random projection (RP), a classic linear dimensionality reduction method. We focus on two areas where, as we have found, employing RP techniques can improve…

Machine Learning · Computer Science 2018-12-27 Piotr Iwo Wójcik

Knowing the Past to Predict the Future: Reinforcement Virtual Learning

Reinforcement Learning (RL)-based control system has received considerable attention in recent decades. However, in many real-world problems, such as Batch Process Control, the environment is uncertain, which requires expensive interaction…

Machine Learning · Computer Science 2022-11-03 Peng Zhang , Yawen Huang , Bingzhang Hu , Shizheng Wang , Haoran Duan , Noura Al Moubayed , Yefeng Zheng , Yang Long

An approach of deep reinforcement learning for maximizing the net present value of stochastic projects

This paper investigates a project with stochastic activity durations and cash flows under discrete scenarios, where activities must satisfy precedence constraints generating cash inflows and outflows. The objective is to maximize expected…

Machine Learning · Computer Science 2025-11-18 Wei Xu , Fan Yang , Qinyuan Cui , Zhi Chen

A Novel Predictive-Coding-Inspired Variational RNN Model for Online Prediction and Recognition

This study introduces PV-RNN, a novel variational RNN inspired by the predictive-coding ideas. The model learns to extract the probabilistic structures hidden in fluctuating temporal patterns by dynamically changing the stochasticity of its…

Machine Learning · Computer Science 2019-06-26 Ahmadreza Ahmadi , Jun Tani

The Value-Improvement Path: Towards Better Representations for Reinforcement Learning

In value-based reinforcement learning (RL), unlike in supervised learning, the agent faces not a single, stationary, approximation problem, but a sequence of value prediction problems. Each time the policy improves, the nature of the…

Machine Learning · Computer Science 2021-01-05 Will Dabney , André Barreto , Mark Rowland , Robert Dadashi , John Quan , Marc G. Bellemare , David Silver

DNA: Proximal Policy Optimization with a Dual Network Architecture

This paper explores the problem of simultaneously learning a value function and policy in deep actor-critic reinforcement learning models. We find that the common practice of learning these functions jointly is sub-optimal, due to an…

Machine Learning · Computer Science 2022-11-15 Matthew Aitchison , Penny Sweetser

VA-learning as a more efficient alternative to Q-learning

In reinforcement learning, the advantage function is critical for policy improvement, but is often extracted from a learned Q-function. A natural question is: Why not learn the advantage function directly? In this work, we introduce…

Machine Learning · Computer Science 2024-09-04 Yunhao Tang , Rémi Munos , Mark Rowland , Michal Valko