Related papers: Potential-Based Shaping and Q-Value Initialization…

Improving the Effectiveness of Potential-Based Reward Shaping in Reinforcement Learning

Potential-based reward shaping is commonly used to incorporate prior knowledge of how to solve the task into reinforcement learning because it can formally guarantee policy invariance. As such, the optimal policy and the ordering of…

Machine Learning · Computer Science 2025-02-04 Henrik Müller , Daniel Kudenko

From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge

Q-shaping is an extension of Q-value initialization and serves as an alternative to reward shaping for incorporating domain knowledge to accelerate agent training, thereby improving sample efficiency by directly shaping Q-values. This…

Artificial Intelligence · Computer Science 2024-10-03 Xiefeng Wu

Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement Learning

Reinforcement learning, which acquires a policy maximizing long-term rewards, has been actively studied. Unfortunately, this learning type is too slow and difficult to use in practical situations because the state-action space becomes huge…

Machine Learning · Computer Science 2024-10-28 Takato Okudo , Seiji Yamada

Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

Reward shaping is an effective technique for incorporating domain knowledge into reinforcement learning (RL). Existing approaches such as potential-based reward shaping normally make full use of a given shaping reward function. However,…

Machine Learning · Computer Science 2020-11-06 Yujing Hu , Weixun Wang , Hangtian Jia , Yixiang Wang , Yingfeng Chen , Jianye Hao , Feng Wu , Changjie Fan

Potential-based Reward Shaping in Sokoban

Learning to solve sparse-reward reinforcement learning problems is difficult, due to the lack of guidance towards the goal. But in some problems, prior knowledge can be used to augment the learning process. Reward shaping is a way to…

Machine Learning · Computer Science 2021-09-14 Zhao Yang , Mike Preuss , Aske Plaat

Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning

Providing reinforcement learning agents with informationally rich human knowledge can dramatically improve various aspects of learning. Prior work has developed different kinds of shaping methods that enable agents to learn efficiently in…

Human-Computer Interaction · Computer Science 2018-11-13 Chao Yu , Tianpei Yang , Wenxuan Zhu , Dongxu wang , Guangliang Li

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

Reinforcement learning provides an automated framework for learning behaviors from high-level reward specifications, but in practice the choice of reward function can be crucial for good results -- while in principle the reward only needs…

Machine Learning · Computer Science 2022-10-19 Abhishek Gupta , Aldo Pacchiano , Yuexiang Zhai , Sham M. Kakade , Sergey Levine

Calculus on MDPs: Potential Shaping as a Gradient

In reinforcement learning, different reward functions can be equivalent in terms of the optimal policies they induce. A particularly well-known and important example is potential shaping, a class of functions that can be added to any reward…

Machine Learning · Computer Science 2022-12-06 Erik Jenner , Herke van Hoof , Adam Gleave

Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks

In continuing tasks, average-reward reinforcement learning may be a more appropriate problem formulation than the more common discounted reward formulation. As usual, learning an optimal policy in this setting typically requires a large…

Artificial Intelligence · Computer Science 2023-01-18 Yuqian Jiang , Sudarshanan Bharadwaj , Bo Wu , Rishi Shah , Ufuk Topcu , Peter Stone

Shaping Advice in Deep Reinforcement Learning

Reinforcement learning involves agents interacting with an environment to complete tasks. When rewards provided by the environment are sparse, agents may not receive immediate feedback on the quality of actions that they take, thereby…

Multiagent Systems · Computer Science 2022-02-22 Baicen Xiao , Bhaskar Ramasubramanian , Radha Poovendran

Benchmarking Potential Based Rewards for Learning Humanoid Locomotion

The main challenge in developing effective reinforcement learning (RL) pipelines is often the design and tuning the reward functions. Well-designed shaping reward can lead to significantly faster learning. Naively formulated rewards,…

Robotics · Computer Science 2023-07-20 Se Hwan Jeon , Steve Heim , Charles Khazoom , Sangbae Kim

Reinforcement Learning by Comparing Immediate Reward

This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate rewards using a variation of Q-Learning algorithm. Unlike the conventional Q-Learning, the proposed algorithm compares current reward with…

Machine Learning · Computer Science 2010-09-15 Punit Pandey , Deepshikha Pandey , Shishir Kumar

A new Potential-Based Reward Shaping for Reinforcement Learning Agent

Potential-based reward shaping (PBRS) is a particular category of machine learning methods which aims to improve the learning speed of a reinforcement learning agent by extracting and utilizing extra knowledge while performing a task. There…

Artificial Intelligence · Computer Science 2023-03-15 Babak Badnava , Mona Esmaeili , Nasser Mozayani , Payman Zarkesh-Ha

Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

Inverse reinforcement learning (IRL) is computationally challenging, with common approaches requiring the solution of multiple reinforcement learning (RL) sub-problems. This work motivates the use of potential-based reward shaping to reduce…

Machine Learning · Computer Science 2023-12-19 Lauren H. Cooke , Harvey Klyne , Edwin Zhang , Cassidy Laidlaw , Milind Tambe , Finale Doshi-Velez

Reward Shaping via Meta-Learning

Reward shaping is one of the most effective methods to tackle the crucial yet challenging problem of credit assignment in Reinforcement Learning (RL). However, designing shaping functions usually requires much expert knowledge and…

Machine Learning · Computer Science 2019-01-29 Haosheng Zou , Tongzheng Ren , Dong Yan , Hang Su , Jun Zhu

Potential-Based Advice for Stochastic Policy Learning

This paper augments the reward received by a reinforcement learning agent with potential functions in order to help the agent learn (possibly stochastic) optimal policies. We show that a potential-based reward shaping scheme is able to…

Machine Learning · Computer Science 2019-07-23 Baicen Xiao , Bhaskar Ramasubramanian , Andrew Clark , Hannaneh Hajishirzi , Linda Bushnell , Radha Poovendran

Reward Shaping for Improved Learning in Real-time Strategy Game Play

We investigate the effect of reward shaping in improving the performance of reinforcement learning in the context of the real-time strategy, capture-the-flag game. The game is characterized by sparse rewards that are associated with…

Machine Learning · Computer Science 2023-11-29 John Kliem , Prithviraj Dasgupta

Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications

The aim of Reinforcement Learning (RL) in real-world applications is to create systems capable of making autonomous decisions by learning from their environment through trial and error. This paper emphasizes the importance of reward…

Machine Learning · Computer Science 2024-12-31 Sinan Ibrahim , Mostafa Mostafa , Ali Jnadi , Hadi Salloum , Pavel Osinenko

Learning to Shape Rewards using a Game of Two Partners

Reward shaping (RS) is a powerful method in reinforcement learning (RL) for overcoming the problem of sparse or uninformative rewards. However, RS typically relies on manually engineered shaping-reward functions whose construction is…

Machine Learning · Computer Science 2023-02-07 David Mguni , Taher Jafferjee , Jianhong Wang , Nicolas Perez-Nieves , Tianpei Yang , Matthew Taylor , Wenbin Song , Feifei Tong , Hui Chen , Jiangcheng Zhu , Jun Wang , Yaodong Yang

Reward Propagation Using Graph Convolutional Networks

Potential-based reward shaping provides an approach for designing good reward functions, with the purpose of speeding up learning. However, automatically finding potential functions for complex environments is a difficult problem (in fact,…

Machine Learning · Computer Science 2021-02-22 Martin Klissarov , Doina Precup