Related papers: Self-Consistent Models and Values

Learning Powerful Policies by Using Consistent Dynamics Model

Model-based Reinforcement Learning approaches have the promise of being sample efficient. Much of the progress in learning dynamics models in RL has been made by learning models via supervised learning. But traditional model-based…

Machine Learning · Computer Science 2019-06-12 Shagun Sodhani , Anirudh Goyal , Tristan Deleu , Yoshua Bengio , Sergey Levine , Jian Tang

Should Models Be Accurate?

Model-based Reinforcement Learning (MBRL) holds promise for data-efficiency by planning with model-generated experience in addition to learning with experience from the environment. However, in complex or changing environments, models in…

Machine Learning · Computer Science 2022-05-24 Esra'a Saleh , John D. Martin , Anna Koop , Arash Pourzarabi , Michael Bowling

Simplified Temporal Consistency Reinforcement Learning

Reinforcement learning is able to solve complex sequential decision-making tasks but is currently limited by sample efficiency and required computation. To improve sample efficiency, recent work focuses on model-based RL which interleaves…

Machine Learning · Computer Science 2023-06-19 Yi Zhao , Wenshuai Zhao , Rinu Boney , Juho Kannala , Joni Pajarinen

The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic Feedback

Reinforcement learning (RL) has demonstrated potential in enhancing the reasoning capabilities of large language models (LLMs), but such training typically demands substantial efforts in creating and annotating data. In this work, we…

Computation and Language · Computer Science 2025-10-06 Hangfan Zhang , Siyuan Xu , Zhimeng Guo , Huaisheng Zhu , Shicheng Liu , Xinrun Wang , Qiaosheng Zhang , Yang Chen , Peng Ye , Lei Bai , Shuyue Hu

A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments

Reinforcement learning (RL) algorithms find applications in inventory control, recommender systems, vehicular traffic management, cloud computing and robotics. The real-world complications of many tasks arising in these domains makes them…

Machine Learning · Computer Science 2021-06-03 Sindhu Padakandla

Learning Dynamics Models for Model Predictive Agents

Model-Based Reinforcement Learning involves learning a \textit{dynamics model} from data, and then using this model to optimise behaviour, most often with an online \textit{planner}. Much of the recent research along these lines presents a…

Machine Learning · Computer Science 2021-09-30 Michael Lutter , Leonard Hasenclever , Arunkumar Byravan , Gabriel Dulac-Arnold , Piotr Trochim , Nicolas Heess , Josh Merel , Yuval Tassa

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

Model-free Reinforcement Learning for Model-based Control: Towards Safe, Interpretable and Sample-efficient Agents

Training sophisticated agents for optimal decision-making under uncertainty has been key to the rapid development of modern autonomous systems across fields. Notably, model-free reinforcement learning (RL) has enabled decision-making agents…

Machine Learning · Computer Science 2025-07-21 Thomas Banker , Ali Mesbah

External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling

Unlike reinforcement learning (RL) agents, humans remain capable multitaskers in changing environments. In spite of only experiencing the world through their own observations and interactions, people know how to balance focusing on tasks…

Artificial Intelligence · Computer Science 2024-07-02 Rishav Bhagat , Jonathan Balloch , Zhiyu Lin , Julia Kim , Mark Riedl

Self-Adapting Goals Allow Transfer of Predictive Models to New Tasks

A long-standing challenge in Reinforcement Learning is enabling agents to learn a model of their environment which can be transferred to solve other problems in a world with the same underlying rules. One reason this is difficult is the…

Machine Learning · Computer Science 2019-05-16 Kai Olav Ellefsen , Jim Torresen

Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning

Learning models of the environment from pure interaction is often considered an essential component of building lifelong reinforcement learning agents. However, the common practice in model-based reinforcement learning is to learn models…

Machine Learning · Computer Science 2023-06-13 Safa Alver , Doina Precup

Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change

Real-world autonomous decision-making systems, from robots to recommendation engines, must operate in environments that change over time. While deep reinforcement learning (RL) has shown an impressive ability to learn optimal policies in…

Machine Learning · Computer Science 2025-05-16 Jonathan Clifford Balloch

Bridging adaptive management and reinforcement learning for more robust decisions

From out-competing grandmasters in chess to informing high-stakes healthcare decisions, emerging methods from artificial intelligence are increasingly capable of making complex and strategic decisions in diverse, high-dimensional, and…

Computers and Society · Computer Science 2024-03-05 Melissa Chapman , Lily Xu , Marcus Lapeyrolerie , Carl Boettiger

Reward Models in Deep Reinforcement Learning: A Survey

In reinforcement learning (RL), agents continually interact with the environment and use the feedback to refine their behavior. To guide policy optimization, reward models are introduced as proxies of the desired objectives, such that when…

Machine Learning · Computer Science 2025-06-19 Rui Yu , Shenghua Wan , Yucen Wang , Chen-Xiao Gao , Le Gan , Zongzhang Zhang , De-Chuan Zhan

Regularizing Model-Based Planning with Energy-Based Models

Model-based reinforcement learning could enable sample-efficient learning by quickly acquiring rich knowledge about the world and using it to improve behaviour without additional data. Learned dynamics models can be directly used for…

Machine Learning · Computer Science 2019-10-15 Rinu Boney , Juho Kannala , Alexander Ilin

Planning with Goal-Conditioned Policies

Planning methods can solve temporally extended sequential decision making problems by composing simple behaviors. However, planning requires suitable abstractions for the states and transitions, which typically need to be designed by hand.…

Machine Learning · Computer Science 2019-11-20 Soroush Nasiriany , Vitchyr H. Pong , Steven Lin , Sergey Levine

Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future

In model-based reinforcement learning, the agent interleaves between model learning and planning. These two components are inextricably intertwined. If the model is not able to provide sensible long-term prediction, the executed planner…

Machine Learning · Statistics 2019-03-19 Nan Rosemary Ke , Amanpreet Singh , Ahmed Touati , Anirudh Goyal , Yoshua Bengio , Devi Parikh , Dhruv Batra

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective

While reinforcement learning (RL) methods that learn an internal model of the environment have the potential to be more sample efficient than their model-free counterparts, learning to model raw observations from high dimensional sensors…

Machine Learning · Computer Science 2023-06-27 Raj Ghugare , Homanga Bharadhwaj , Benjamin Eysenbach , Sergey Levine , Ruslan Salakhutdinov

The Value Equivalence Principle for Model-Based Reinforcement Learning

Learning models of the environment from data is often viewed as an essential component to building intelligent reinforcement learning (RL) agents. The common practice is to separate the learning of the model from its use, by constructing a…

Machine Learning · Computer Science 2020-11-09 Christopher Grimm , André Barreto , Satinder Singh , David Silver

Act Better by Timing: A timing-Aware Reinforcement Learning for Autonomous Driving

Autonomous vehicles inevitably encounter a vast array of scenarios in real-world environments. Addressing long-tail scenarios, particularly those involving intensive interactions with numerous traffic participants, remains one of the most…

Robotics · Computer Science 2024-12-16 Guanzhou Li , Jianping Wu , Yujing He