English
Related papers

Related papers: Potential-Based Shaping and Q-Value Initialization…

200 papers

Potential-based reward shaping is commonly used to incorporate prior knowledge of how to solve the task into reinforcement learning because it can formally guarantee policy invariance. As such, the optimal policy and the ordering of…

Machine Learning · Computer Science 2025-02-04 Henrik Müller , Daniel Kudenko

Q-shaping is an extension of Q-value initialization and serves as an alternative to reward shaping for incorporating domain knowledge to accelerate agent training, thereby improving sample efficiency by directly shaping Q-values. This…

Artificial Intelligence · Computer Science 2024-10-03 Xiefeng Wu

Reinforcement learning, which acquires a policy maximizing long-term rewards, has been actively studied. Unfortunately, this learning type is too slow and difficult to use in practical situations because the state-action space becomes huge…

Machine Learning · Computer Science 2024-10-28 Takato Okudo , Seiji Yamada

Reward shaping is an effective technique for incorporating domain knowledge into reinforcement learning (RL). Existing approaches such as potential-based reward shaping normally make full use of a given shaping reward function. However,…

Machine Learning · Computer Science 2020-11-06 Yujing Hu , Weixun Wang , Hangtian Jia , Yixiang Wang , Yingfeng Chen , Jianye Hao , Feng Wu , Changjie Fan

Learning to solve sparse-reward reinforcement learning problems is difficult, due to the lack of guidance towards the goal. But in some problems, prior knowledge can be used to augment the learning process. Reward shaping is a way to…

Machine Learning · Computer Science 2021-09-14 Zhao Yang , Mike Preuss , Aske Plaat

Providing reinforcement learning agents with informationally rich human knowledge can dramatically improve various aspects of learning. Prior work has developed different kinds of shaping methods that enable agents to learn efficiently in…

Human-Computer Interaction · Computer Science 2018-11-13 Chao Yu , Tianpei Yang , Wenxuan Zhu , Dongxu wang , Guangliang Li

Reinforcement learning provides an automated framework for learning behaviors from high-level reward specifications, but in practice the choice of reward function can be crucial for good results -- while in principle the reward only needs…

Machine Learning · Computer Science 2022-10-19 Abhishek Gupta , Aldo Pacchiano , Yuexiang Zhai , Sham M. Kakade , Sergey Levine

In reinforcement learning, different reward functions can be equivalent in terms of the optimal policies they induce. A particularly well-known and important example is potential shaping, a class of functions that can be added to any reward…

Machine Learning · Computer Science 2022-12-06 Erik Jenner , Herke van Hoof , Adam Gleave

In continuing tasks, average-reward reinforcement learning may be a more appropriate problem formulation than the more common discounted reward formulation. As usual, learning an optimal policy in this setting typically requires a large…

Artificial Intelligence · Computer Science 2023-01-18 Yuqian Jiang , Sudarshanan Bharadwaj , Bo Wu , Rishi Shah , Ufuk Topcu , Peter Stone

Reinforcement learning involves agents interacting with an environment to complete tasks. When rewards provided by the environment are sparse, agents may not receive immediate feedback on the quality of actions that they take, thereby…

Multiagent Systems · Computer Science 2022-02-22 Baicen Xiao , Bhaskar Ramasubramanian , Radha Poovendran

The main challenge in developing effective reinforcement learning (RL) pipelines is often the design and tuning the reward functions. Well-designed shaping reward can lead to significantly faster learning. Naively formulated rewards,…

Robotics · Computer Science 2023-07-20 Se Hwan Jeon , Steve Heim , Charles Khazoom , Sangbae Kim

This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate rewards using a variation of Q-Learning algorithm. Unlike the conventional Q-Learning, the proposed algorithm compares current reward with…

Machine Learning · Computer Science 2010-09-15 Punit Pandey , Deepshikha Pandey , Shishir Kumar

Potential-based reward shaping (PBRS) is a particular category of machine learning methods which aims to improve the learning speed of a reinforcement learning agent by extracting and utilizing extra knowledge while performing a task. There…

Artificial Intelligence · Computer Science 2023-03-15 Babak Badnava , Mona Esmaeili , Nasser Mozayani , Payman Zarkesh-Ha

Inverse reinforcement learning (IRL) is computationally challenging, with common approaches requiring the solution of multiple reinforcement learning (RL) sub-problems. This work motivates the use of potential-based reward shaping to reduce…

Machine Learning · Computer Science 2023-12-19 Lauren H. Cooke , Harvey Klyne , Edwin Zhang , Cassidy Laidlaw , Milind Tambe , Finale Doshi-Velez

Reward shaping is one of the most effective methods to tackle the crucial yet challenging problem of credit assignment in Reinforcement Learning (RL). However, designing shaping functions usually requires much expert knowledge and…

Machine Learning · Computer Science 2019-01-29 Haosheng Zou , Tongzheng Ren , Dong Yan , Hang Su , Jun Zhu

This paper augments the reward received by a reinforcement learning agent with potential functions in order to help the agent learn (possibly stochastic) optimal policies. We show that a potential-based reward shaping scheme is able to…

Machine Learning · Computer Science 2019-07-23 Baicen Xiao , Bhaskar Ramasubramanian , Andrew Clark , Hannaneh Hajishirzi , Linda Bushnell , Radha Poovendran

We investigate the effect of reward shaping in improving the performance of reinforcement learning in the context of the real-time strategy, capture-the-flag game. The game is characterized by sparse rewards that are associated with…

Machine Learning · Computer Science 2023-11-29 John Kliem , Prithviraj Dasgupta

The aim of Reinforcement Learning (RL) in real-world applications is to create systems capable of making autonomous decisions by learning from their environment through trial and error. This paper emphasizes the importance of reward…

Machine Learning · Computer Science 2024-12-31 Sinan Ibrahim , Mostafa Mostafa , Ali Jnadi , Hadi Salloum , Pavel Osinenko

Reward shaping (RS) is a powerful method in reinforcement learning (RL) for overcoming the problem of sparse or uninformative rewards. However, RS typically relies on manually engineered shaping-reward functions whose construction is…

Potential-based reward shaping provides an approach for designing good reward functions, with the purpose of speeding up learning. However, automatically finding potential functions for complex environments is a difficult problem (in fact,…

Machine Learning · Computer Science 2021-02-22 Martin Klissarov , Doina Precup
‹ Prev 1 2 3 10 Next ›