English
Related papers

Related papers: Accelerated Multi-objective Task Learning using Mo…

200 papers

Q-Learning is a fundamental off-policy reinforcement learning (RL) algorithm that has the objective of approximating action-value functions in order to learn optimal policies. Nonetheless, it has difficulties in reconciling bias with…

Machine Learning · Computer Science 2024-11-22 Mahammad Humayoo

Quality Diversity (QD) has shown great success in discovering high-performing, diverse policies for robot skill learning. While current benchmarks have led to the development of powerful QD methods, we argue that new paradigms must be…

Robotics · Computer Science 2024-07-26 Sumeet Batra , Bryon Tjanaka , Stefanos Nikolaidis , Gaurav Sukhatme

This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate rewards using a variation of Q-Learning algorithm. Unlike the conventional Q-Learning, the proposed algorithm compares current reward with…

Machine Learning · Computer Science 2010-09-15 Punit Pandey , Deepshikha Pandey , Shishir Kumar

The development of machine learning algorithms has been gathering relevance to address the increasing modelling complexity of manufacturing decision-making problems. Reinforcement learning is a methodology with great potential due to the…

Machine Learning · Computer Science 2023-04-18 Miguel Neves , Miguel Vieira , Pedro Neto

Deep Q-learning has achieved significant success in single-agent decision making tasks. However, it is challenging to extend Q-learning to large-scale multi-agent scenarios, due to the explosion of action space resulting from the complex…

Multiagent Systems · Computer Science 2019-10-14 Ming Zhou , Yong Chen , Ying Wen , Yaodong Yang , Yufeng Su , Weinan Zhang , Dell Zhang , Jun Wang

Q-learning is a widely used reinforcement learning technique for solving path planning problems. It primarily involves the interaction between an agent and its environment, enabling the agent to learn an optimal strategy that maximizes…

Robotics · Computer Science 2024-12-18 Yiming Ji , Kaijie Yun , Yang Liu , Zongwu Xie , Hong Liu

This paper studies accelerated algorithms for Q-learning. We propose an acceleration scheme by incorporating the historical iterates of the Q-function. The idea is conceptually inspired by the momentum-based acceleration methods in the…

Systems and Control · Electrical Eng. & Systems 2019-10-28 Bowen Weng , Lin Zhao , Huaqing Xiong , Wei Zhang

Q-learning is a stochastic approximation version of the classic value iteration. The literature has established that Q-learning suffers from both maximization bias and slower convergence. Recently, multi-step algorithms have shown practical…

Machine Learning · Computer Science 2024-07-03 Antony Vijesh , Shreyas S R

Applying Q-learning to high-dimensional or continuous action spaces can be difficult due to the required maximization over the set of possible actions. Motivated by techniques from amortized inference, we replace the expensive maximization…

Machine Learning · Computer Science 2020-01-23 Tom Van de Wiele , David Warde-Farley , Andriy Mnih , Volodymyr Mnih

An improvement of Q-learning is proposed in this paper. It is different from classic Q-learning in that the similarity between different states and actions is considered in the proposed method. During the training, a new updating mechanism…

Artificial Intelligence · Computer Science 2021-06-03 Wei Liao , Xiaohui Wei , Jizhou Lai

The Q-learning algorithm is known to be affected by the maximization bias, i.e. the systematic overestimation of action values, an important issue that has recently received renewed attention. Double Q-learning has been proposed as an…

Machine Learning · Computer Science 2021-02-03 Rong Zhu , Mattia Rigotti

Traditional multi-agent reinforcement learning algorithms are not scalable to environments with more than a few agents, since these algorithms are exponential in the number of agents. Recent research has introduced successful methods to…

Multiagent Systems · Computer Science 2021-01-26 Sriram Ganapathi Subramanian , Matthew E. Taylor , Mark Crowley , Pascal Poupart

This study presents a novel computer system performance optimization and adaptive workload management scheduling algorithm based on Q-learning. In modern computing environments, characterized by increasing data volumes, task complexity, and…

Machine Learning · Computer Science 2024-11-11 Pochun Li , Yuyang Xiao , Jinghua Yan , Xuan Li , Xiaoye Wang

Finding the optimum path for a robot for moving from start to the goal position through obstacles is still a challenging issue. This paper presents a novel path planning method, named D-point trigonometric, based on Q-learning algorithm for…

Artificial Intelligence · Computer Science 2022-11-29 Ehsan Jeihaninejad , Azam Rabiee

The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of…

Machine Learning · Statistics 2015-06-04 Soummya Kar , Jose' M. F. Moura , H. Vincent Poor

Q-learning is one of the most popular methods in Reinforcement Learning (RL). Transfer Learning aims to utilize the learned knowledge from source tasks to help new tasks to improve the sample complexity of the new tasks. Considering that…

Machine Learning · Computer Science 2018-09-25 Yue Wang , Qi Meng , Wei Cheng , Yuting Liug , Zhi-Ming Ma , Tie-Yan Liu

Despite the success of reinforcement learning methods, they have yet to have their breakthrough moment when applied to a broad range of robotic manipulation tasks. This is partly due to the fact that reinforcement learning algorithms are…

Robotics · Computer Science 2022-02-07 Stephen James , Andrew J. Davison

The use of target networks is a common practice in deep reinforcement learning for stabilizing the training; however, theoretical understanding of this technique is still limited. In this paper, we study the so-called periodic Q-learning…

Machine Learning · Computer Science 2020-02-25 Donghwan Lee , Niao He

In this work, we present a scalable reinforcement learning method for training multi-task policies from large offline datasets that can leverage both human demonstrations and autonomously collected data. Our method uses a Transformer to…

This paper studies accelerations in Q-learning algorithms. We propose an accelerated target update scheme by incorporating the historical iterates of Q functions. The idea is conceptually inspired by the momentum-based accelerated methods…

Machine Learning · Computer Science 2019-05-14 Bowen Weng , Huaqing Xiong , Wei Zhang
‹ Prev 1 2 3 10 Next ›