Related papers: Human-Inspired Multi-Level Reinforcement Learning

Reinforcement Learning

Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decision-making…

Machine Learning · Computer Science 2020-06-16 Olivier Buffet , Olivier Pietquin , Paul Weng

Multi-Task Reward Learning from Human Ratings

Reinforcement learning from human feedback (RLHF) has become a key factor in aligning model behavior with users' goals. However, while humans integrate multiple strategies when making decisions, current RLHF approaches often simplify this…

Machine Learning · Computer Science 2025-06-19 Mingkang Wu , Devin White , Evelyn Rose , Vernon Lawhern , Nicholas R Waytowich , Yongcan Cao

Reinforcement Learning from Multi-level and Episodic Human Feedback

Designing an effective reward function has long been a challenge in reinforcement learning, particularly for complex tasks in unstructured environments. To address this, various learning paradigms have emerged that leverage different forms…

Machine Learning · Computer Science 2025-04-29 Muhammad Qasim Elahi , Somtochukwu Oguchienti , Maheed H. Ahmed , Mahsa Ghasemi

Introduction to Reinforcement Learning

Reinforcement Learning (RL), a subfield of Artificial Intelligence (AI), focuses on training agents to make decisions by interacting with their environment to maximize cumulative rewards. This paper provides an overview of RL, covering its…

Artificial Intelligence · Computer Science 2024-12-04 Majid Ghasemi , Dariush Ebrahimi

Multi-task Reinforcement Learning in Reproducing Kernel Hilbert Spaces via Cross-learning

Reinforcement learning (RL) is a framework to optimize a control policy using rewards that are revealed by the system as a response to a control action. In its standard form, RL involves a single agent that uses its policy to accomplish a…

Systems and Control · Electrical Eng. & Systems 2021-11-24 Juan Cervino , Juan Andres Bazerque , Miguel Calvo-Fullana , Alejandro Ribeiro

A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges

Reinforcement Learning (RL) has emerged as a powerful paradigm in Artificial Intelligence (AI), enabling agents to learn optimal behaviors through interactions with their environments. Drawing from the foundations of trial and error, RL…

Artificial Intelligence · Computer Science 2025-02-04 Majid Ghasemi , Amir Hossein Moosavi , Dariush Ebrahimi

Reinforcement Learning from Diverse Human Preferences

The complexity of designing reward functions has been a major obstacle to the wide application of deep reinforcement learning (RL) techniques. Describing an agent's desired behaviors and properties can be difficult, even for experts. A new…

Machine Learning · Computer Science 2024-05-09 Wanqi Xue , Bo An , Shuicheng Yan , Zhongwen Xu

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Reinforcement learning (RL) is a powerful machine learning technique that enables an intelligent agent to learn an optimal policy that maximizes the cumulative rewards in sequential decision making. Most of methods in the existing…

Machine Learning · Statistics 2023-01-06 Chengchun Shi , Zhengling Qi , Jianing Wang , Fan Zhou

A Short Survey On Memory Based Reinforcement Learning

Reinforcement learning (RL) is a branch of machine learning which is employed to solve various sequential decision making problems without proper supervision. Due to the recent advancement of deep learning, the newly proposed Deep-RL…

Artificial Intelligence · Computer Science 2019-04-17 Dhruv Ramani

Reward-Conditioned Reinforcement Learning

Single-task RL agents are typically trained under a fixed reward function, which limits their robustness to reward misspecification and their ability to adapt to changing preferences. We introduce Reward-Conditioned Reinforcement Learning…

Machine Learning · Computer Science 2026-05-20 Michal Nauman , Marek Cygan , Pieter Abbeel

Experiential Explanations for Reinforcement Learning

Reinforcement learning (RL) systems can be complex and non-interpretable, making it challenging for non-AI experts to understand or intervene in their decisions. This is due in part to the sequential nature of RL in which actions are chosen…

Artificial Intelligence · Computer Science 2025-04-16 Amal Alabdulkarim , Madhuri Singh , Gennie Mansi , Kaely Hall , Upol Ehsan , Mark O. Riedl

Reinforcement Learning with Lookahead Information

We study reinforcement learning (RL) problems in which agents observe the reward or transition realizations at their current state before deciding which action to take. Such observations are available in many applications, including…

Machine Learning · Computer Science 2024-10-22 Nadav Merlis

CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning

Reinforcement Learning (RL) is a general framework concerned with an agent that seeks to maximize rewards in an environment. The learning typically happens through trial and error using explorative methods, such as epsilon-greedy. There are…

Machine Learning · Computer Science 2022-10-06 Per-Arne Andersen , Morten Goodwin , Ole-Christoffer Granmo

Reward Shaping for Human Learning via Inverse Reinforcement Learning

Humans are spectacular reinforcement learners, constantly learning from and adjusting to experience and feedback. Unfortunately, this doesn't necessarily mean humans are fast learners. When tasks are challenging, learning can become…

Machine Learning · Computer Science 2022-12-16 Mark A. Rucker , Layne T. Watson , Matthew S. Gerber , Laura E. Barnes

Learning from Failures: Understanding LLM Alignment through Failure-Aware Inverse RL

Reinforcement Learning from Human Feedback (RLHF) aligns Large Language Models (LLMs) with human preferences, yet the underlying reward signals they internalize remain hidden, posing a critical challenge for interpretability and safety.…

Machine Learning · Computer Science 2026-01-21 Nyal Patel , Matthieu Bou , Arjun Jagota , Satyapriya Krishna , Sonali Parbhoo

Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning

Reinforcement learning (RL) studies how an agent comes to achieve reward in an environment through interactions over time. Recent advances in machine RL have surpassed human expertise at the world's oldest board games and many classic video…

Artificial Intelligence · Computer Science 2021-07-28 Pedro A. Tsividis , Joao Loula , Jake Burga , Nathan Foss , Andres Campero , Thomas Pouncy , Samuel J. Gershman , Joshua B. Tenenbaum

Rating-based Reinforcement Learning

This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning…

Machine Learning · Computer Science 2024-01-30 Devin White , Mingkang Wu , Ellen Novoseller , Vernon J. Lawhern , Nicholas Waytowich , Yongcan Cao

Enhanced Penalty-based Bidirectional Reinforcement Learning Algorithms

This research focuses on enhancing reinforcement learning (RL) algorithms by integrating penalty functions to guide agents in avoiding unwanted actions while optimizing rewards. The goal is to improve the learning process by ensuring that…

Machine Learning · Computer Science 2025-04-07 Sai Gana Sandeep Pula , Sathish A. P. Kumar , Sumit Jha , Arvind Ramanathan

Modeling and Interpreting Real-world Human Risk Decision Making with Inverse Reinforcement Learning

We model human decision-making behaviors in a risk-taking task using inverse reinforcement learning (IRL) for the purposes of understanding real human decision making under risk. To the best of our knowledge, this is the first work applying…

Machine Learning · Computer Science 2019-06-14 Quanying Liu , Haiyan Wu , Anqi Liu

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon