Related papers: Learning Rules from Rewards

Reinforcement Learning

Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decision-making…

Machine Learning · Computer Science 2020-06-16 Olivier Buffet , Olivier Pietquin , Paul Weng

Human-Allied Relational Reinforcement Learning

Reinforcement learning (RL) has experienced a second wind in the past decade. While incredibly successful in images and videos, these systems still operate within the realm of propositional tasks ignoring the inherent structure that exists…

Machine Learning · Computer Science 2025-10-21 Fateme Golivand Darvishvand , Hikaru Shindo , Sahil Sidheekh , Kristian Kersting , Sriraam Natarajan

Human-Inspired Multi-Level Reinforcement Learning

Reinforcement learning (RL), a common tool in decision making, learns control policies from various experiences based on the associated cumulative return/rewards without treating them differently. Humans, on the contrary, often learn to…

Machine Learning · Computer Science 2025-11-25 Mingkang Wu , Devin White , Vernon Lawhern , Nicholas R. Waytowich , Yongcan Cao

A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges

Reinforcement Learning (RL) has emerged as a powerful paradigm in Artificial Intelligence (AI), enabling agents to learn optimal behaviors through interactions with their environments. Drawing from the foundations of trial and error, RL…

Artificial Intelligence · Computer Science 2025-02-04 Majid Ghasemi , Amir Hossein Moosavi , Dariush Ebrahimi

Model-Based Reinforcement Learning for Atari

Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction -- substantially more,…

Machine Learning · Computer Science 2024-04-04 Lukasz Kaiser , Mohammad Babaeizadeh , Piotr Milos , Blazej Osinski , Roy H Campbell , Konrad Czechowski , Dumitru Erhan , Chelsea Finn , Piotr Kozakowski , Sergey Levine , Afroz Mohiuddin , Ryan Sepassi , George Tucker , Henryk Michalewski

Partial Policy Gradients for RL in LLMs

Reinforcement learning is a framework for learning to act sequentially in an unknown environment. We propose a natural approach for modeling policy structure in policy gradients. The key idea is to optimize for a subset of future rewards:…

Machine Learning · Computer Science 2026-03-09 Puneet Mathur , Branislav Kveton , Subhojyoti Mukherjee , Viet Dac Lai

Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning

We propose a novel adaptive transfer learning framework, learning to transfer learn (L2TL), to improve performance on a target dataset by careful extraction of the related information from a source dataset. Our framework considers…

Machine Learning · Computer Science 2020-07-17 Linchao Zhu , Sercan O. Arik , Yi Yang , Tomas Pfister

Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions

The potential of reinforcement learning (RL) to deliver aligned and performant agents is partially bottlenecked by the reward engineering problem. One alternative to heuristic trial-and-error is preference-based RL (PbRL), where a reward…

Machine Learning · Computer Science 2021-12-22 Tom Bewley , Freddy Lecue

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

Multi-task representation learning (MTRL) is an approach that learns shared latent representations across related tasks, facilitating collaborative learning that improves the overall learning efficiency. This paper studies MTRL for…

Machine Learning · Computer Science 2026-04-07 Yaoze Guo , Shana Moothedath

Relational Reinforcement Learning in Infinite Mario

Relational representations in reinforcement learning allow for the use of structural information like the presence of objects and relationships between them in the description of value functions. Through this paper, we show that such…

Artificial Intelligence · Computer Science 2012-03-01 Shiwali Mohan , John E. Laird

Reinforcement Learning With Temporal Logic Rewards

Reinforcement learning (RL) depends critically on the choice of reward functions used to capture the de- sired behavior and constraints of a robot. Usually, these are handcrafted by a expert designer and represent heuristics for relatively…

Artificial Intelligence · Computer Science 2017-03-03 Xiao Li , Cristian-Ioan Vasile , Calin Belta

AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning

One practical challenge in reinforcement learning (RL) is how to make quick adaptations when faced with new environments. In this paper, we propose a principled framework for adaptive RL, called \textit{AdaRL}, that adapts reliably and…

Machine Learning · Computer Science 2022-03-16 Biwei Huang , Fan Feng , Chaochao Lu , Sara Magliacane , Kun Zhang

Reciprocal Co-Training (RCT): Coupling Gradient-Based and Non-Differentiable Models via Reinforcement Learning

Large language models (LLMs) and classical machine learning methods offer complementary strengths for predictive modeling, yet their fundamentally different representations and training paradigms hinder effective integration: LLMs rely on…

Computation and Language · Computer Science 2026-04-21 Yunshuo Tian , Akayou Kitessa , Tanuja Chitnis , Yijun Zhao

Decision Transformer: Reinforcement Learning via Sequence Modeling

We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence modeling problem. This allows us to draw upon the simplicity and scalability of the Transformer architecture, and associated advances in language modeling…

Machine Learning · Computer Science 2021-06-25 Lili Chen , Kevin Lu , Aravind Rajeswaran , Kimin Lee , Aditya Grover , Michael Laskin , Pieter Abbeel , Aravind Srinivas , Igor Mordatch

Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning

Reinforcement learning (RL) studies how an agent comes to achieve reward in an environment through interactions over time. Recent advances in machine RL have surpassed human expertise at the world's oldest board games and many classic video…

Artificial Intelligence · Computer Science 2021-07-28 Pedro A. Tsividis , Joao Loula , Jake Burga , Nathan Foss , Andres Campero , Thomas Pouncy , Samuel J. Gershman , Joshua B. Tenenbaum

Explaining Conditions for Reinforcement Learning Behaviors from Real and Imagined Data

The deployment of reinforcement learning (RL) in the real world comes with challenges in calibrating user trust and expectations. As a step toward developing RL systems that are able to communicate their competencies, we present a method of…

Machine Learning · Computer Science 2020-11-19 Aastha Acharya , Rebecca Russell , Nisar R. Ahmed

Introduction to Reinforcement Learning

Reinforcement Learning (RL), a subfield of Artificial Intelligence (AI), focuses on training agents to make decisions by interacting with their environment to maximize cumulative rewards. This paper provides an overview of RL, covering its…

Artificial Intelligence · Computer Science 2024-12-04 Majid Ghasemi , Dariush Ebrahimi

Learning Nonlinear Causal Reductions to Explain Reinforcement Learning Policies

Why do reinforcement learning (RL) policies fail or succeed? This is a challenging question due to the complex, high-dimensional nature of agent-environment interactions. In this work, we take a causal perspective on explaining the behavior…

Machine Learning · Statistics 2025-07-22 Armin Kekić , Jan Schneider , Dieter Büchler , Bernhard Schölkopf , Michel Besserve

Robot Representation and Reasoning with Knowledge from Reinforcement Learning

Reinforcement learning (RL) agents aim at learning by interacting with an environment, and are not designed for representing or reasoning with declarative knowledge. Knowledge representation and reasoning (KRR) paradigms are strong in…

Artificial Intelligence · Computer Science 2018-11-26 Keting Lu , Shiqi Zhang , Peter Stone , Xiaoping Chen

Multi-Task Recommendations with Reinforcement Learning

In recent years, Multi-task Learning (MTL) has yielded immense success in Recommender System (RS) applications. However, current MTL-based recommendation models tend to disregard the session-wise patterns of user-item interactions because…

Information Retrieval · Computer Science 2023-03-13 Ziru Liu , Jiejie Tian , Qingpeng Cai , Xiangyu Zhao , Jingtong Gao , Shuchang Liu , Dayou Chen , Tonghao He , Dong Zheng , Peng Jiang , Kun Gai