Related papers: Distributed Deep Q-Learning

Distributed Q-Learning for Dynamically Decoupled Systems

Control of large-scale networked systems often necessitates the availability of complex models for the interactions amongst the agents. However in many applications, building accurate models of agents or interactions amongst them might be…

Optimization and Control · Mathematics 2019-03-21 Siavash Alemzadeh , Mehran Mesbahi

Playing Atari with Deep Reinforcement Learning

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning,…

Machine Learning · Computer Science 2013-12-20 Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Alex Graves , Ioannis Antonoglou , Daan Wierstra , Martin Riedmiller

A Robotic Auto-Focus System based on Deep Reinforcement Learning

Considering its advantages in dealing with high-dimensional visual input and learning control policies in discrete domain, Deep Q Network (DQN) could be an alternative method of traditional auto-focus means in the future. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2018-09-11 Xiaofan Yu , Runze Yu , Jingsong Yang , Xiaohui Duan

Modular Multi-Objective Deep Reinforcement Learning with Decision Values

In this work we present a method for using Deep Q-Networks (DQNs) in multi-objective environments. Deep Q-Networks provide remarkable performance in single objective problems learning from high-level visual state representations. However,…

Artificial Intelligence · Computer Science 2018-02-26 Tomasz Tajmajer

Modular Deep Q Networks for Sim-to-real Transfer of Visuo-motor Policies

While deep learning has had significant successes in computer vision thanks to the abundance of visual data, collecting sufficiently large real-world datasets for robot learning can be costly. To increase the practicality of these…

Robotics · Computer Science 2017-12-20 Fangyi Zhang , Jürgen Leitner , Michael Milford , Peter Corke

Deep Q learning for fooling neural networks

Deep learning models are vulnerable to external attacks. In this paper, we propose a Reinforcement Learning (RL) based approach to generate adversarial examples for the pre-trained (target) models. We assume a semi black-box setting where…

Machine Learning · Computer Science 2018-11-15 Mandar Kulkarni

A Deep Q-Learning Agent for the L-Game with Variable Batch Training

We employ the Deep Q-Learning algorithm with Experience Replay to train an agent capable of achieving a high-level of play in the L-Game while self-learning from low-dimensional states. We also employ variable batch size for training in…

Machine Learning · Computer Science 2018-02-20 Petros Giannakopoulos , Yannis Cotronis

Automatic Reward Shaping from Confounded Offline Data

A key task in Artificial Intelligence is learning effective policies for controlling agents in unknown environments to optimize performance measures. Off-policy learning methods, like Q-learning, allow learners to make optimal decisions…

Artificial Intelligence · Computer Science 2025-09-10 Mingxuan Li , Junzhe Zhang , Elias Bareinboim

Towards Better Interpretability in Deep Q-Networks

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these…

Machine Learning · Computer Science 2018-11-16 Raghuram Mandyam Annasamy , Katia Sycara

Transformer Based Reinforcement Learning For Games

Recent times have witnessed sharp improvements in reinforcement learning tasks using deep reinforcement learning techniques like Deep Q Networks, Policy Gradients, Actor Critic methods which are based on deep learning based models and…

Machine Learning · Computer Science 2019-12-10 Uddeshya Upadhyay , Nikunj Shah , Sucheta Ravikanti , Mayanka Medhe

DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction

Deep reinforcement learning can learn effective policies for a wide range of tasks, but is notoriously difficult to use due to instability and sensitivity to hyperparameters. The reasons for this remain unclear. When using standard…

Machine Learning · Computer Science 2020-03-17 Aviral Kumar , Abhishek Gupta , Sergey Levine

Using Deep Q-Learning to Control Optimization Hyperparameters

We present a novel definition of the reinforcement learning state, actions and reward function that allows a deep Q-network (DQN) to learn to control an optimization hyperparameter. Using Q-learning with experience replay, we train two DQNs…

Optimization and Control · Mathematics 2016-06-21 Samantha Hansen

Distributional Reinforcement Learning with Quantile Regression

In reinforcement learning an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the…

Artificial Intelligence · Computer Science 2017-10-30 Will Dabney , Mark Rowland , Marc G. Bellemare , Rémi Munos

Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning

In state of the art model-free off-policy deep reinforcement learning, a replay memory is used to store past experience and derive all network updates. Even if both state and action spaces are continuous, the replay memory only holds a…

Machine Learning · Computer Science 2020-07-16 Sabrina Hoppe , Marc Toussaint

Efficient Off-Policy Reinforcement Learning via Brain-Inspired Computing

Reinforcement Learning (RL) has opened up new opportunities to enhance existing smart systems that generally include a complex decision-making process. However, modern RL algorithms, e.g., Deep Q-Networks (DQN), are based on deep neural…

Machine Learning · Computer Science 2023-06-22 Yang Ni , Danny Abraham , Mariam Issa , Yeseong Kim , Pietro Mercati , Mohsen Imani

Deep Reinforcement Learning for Imbalanced Classification

Data in real-world application often exhibit skewed class distribution which poses an intense challenge for machine learning. Conventional classification algorithms are not effective in the case of imbalanced data distribution, and may fail…

Machine Learning · Computer Science 2019-01-08 Enlu Lin , Qiong Chen , Xiaoming Qi

Learning a Diffusion Model Policy from Rewards via Q-Score Matching

Diffusion models have become a popular choice for representing actor policies in behavior cloning and offline reinforcement learning. This is due to their natural ability to optimize an expressive class of distributions over a continuous…

Machine Learning · Computer Science 2025-02-17 Michael Psenka , Alejandro Escontrela , Pieter Abbeel , Yi Ma

Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates

Reinforcement learning holds the promise of enabling autonomous robots to learn large repertoires of behavioral skills with minimal human intervention. However, robotic applications of reinforcement learning often compromise the autonomy of…

Robotics · Computer Science 2016-11-24 Shixiang Gu , Ethan Holly , Timothy Lillicrap , Sergey Levine

Deep Q-learning: a robust control approach

In this paper, we place deep Q-learning into a control-oriented perspective and study its learning dynamics with well-established techniques from robust control. We formulate an uncertain linear time-invariant model by means of the neural…

Machine Learning · Computer Science 2022-11-08 Balazs Varga , Balazs Kulcsar , Morteza Haghir Chehreghani

Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms

Distributed Distributional DrQ is a model-free and off-policy RL algorithm for continuous control tasks based on the state and observation of the agent, which is an actor-critic method with the data-augmentation and the distributional…

Machine Learning · Computer Science 2024-04-17 Zehao Zhou