Related papers: Maximum Entropy Model Rollouts: Fast Model Based P…

Learning Maximum Entropy Models from finite size datasets: a fast Data-Driven algorithm allows sampling from the posterior distribution

Maximum entropy models provide the least constrained probability distributions that reproduce statistical properties of experimental datasets. In this work we characterize the learning dynamics that maximizes the log-likelihood in the case…

Disordered Systems and Neural Networks · Physics 2016-09-21 Ulisse Ferrari

Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling

Reinforcement learning algorithms can acquire policies for complex tasks autonomously. However, the number of samples required to learn a diverse set of skills can be prohibitively large. While meta-reinforcement learning methods have…

Machine Learning · Computer Science 2020-06-17 Russell Mendonca , Xinyang Geng , Chelsea Finn , Sergey Levine

Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL

Model-based reinforcement learning promises to learn an optimal policy from fewer interactions with the environment compared to model-free reinforcement learning by learning an intermediate model of the environment in order to predict…

Machine Learning · Computer Science 2022-06-08 Abhinav Bhatia , Philip S. Thomas , Shlomo Zilberstein

Model-Based Reinforcement Learning via Meta-Policy Optimization

Model-based reinforcement learning approaches carry the promise of being data efficient. However, due to challenges in learning dynamics models that sufficiently match the real-world dynamics, they struggle to achieve the same asymptotic…

Machine Learning · Computer Science 2018-09-17 Ignasi Clavera , Jonas Rothfuss , John Schulman , Yasuhiro Fujita , Tamim Asfour , Pieter Abbeel

On Rollouts in Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) seeks to enhance data efficiency by learning a model of the environment and generating synthetic rollouts from it. However, accumulated model errors during these rollouts can distort the data…

Machine Learning · Computer Science 2025-04-09 Bernd Frauenknecht , Devdutt Subhasish , Friedrich Solowjow , Sebastian Trimpe

Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems

Model-free deep-reinforcement-based learning algorithms have been applied to a range of COPs~\cite{bello2016neural}~\cite{kool2018attention}~\cite{nazari2018reinforcement}. However, these approaches suffer from two key challenges when…

Machine Learning · Computer Science 2022-06-01 Nasrin Sultana , Jeffrey Chan , Tabinda Sarwar , A. K. Qin

Maximum Entropy Model-based Reinforcement Learning

Recent advances in reinforcement learning have demonstrated its ability to solve hard agent-environment interaction tasks on a super-human level. However, the application of reinforcement learning methods to practical and real-world tasks…

Artificial Intelligence · Computer Science 2021-12-03 Oleg Svidchenko , Aleksei Shpilman

Deep Incremental Model Informed Reinforcement Learning for Continuous Robotic Control

Model-based reinforcement learning attempts to use an available or learned model to improve the data efficiency of reinforcement learning. This work proposes a one-step lookback approach that jointly learns the deep incremental model and…

Robotics · Computer Science 2025-02-28 Cong Li

Combating the Compounding-Error Problem with a Multi-step Model

Model-based reinforcement learning is an appealing framework for creating agents that learn, plan, and act in sequential environments. Model-based algorithms typically involve learning a transition model that takes a state and an action and…

Machine Learning · Computer Science 2019-06-03 Kavosh Asadi , Dipendra Misra , Seungchan Kim , Michel L. Littman

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation

Model-based deep reinforcement learning has achieved success in various domains that require high sample efficiencies, such as Go and robotics. However, there are some remaining issues, such as planning efficient explorations to learn more…

Machine Learning · Computer Science 2021-07-06 Yao Yao , Li Xiao , Zhicheng An , Wanpeng Zhang , Dijun Luo

Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning

Model-free deep reinforcement learning algorithms have been shown to be capable of learning a wide range of robotic skills, but typically require a very large number of samples to achieve good performance. Model-based algorithms, in…

Machine Learning · Computer Science 2017-12-05 Anusha Nagabandi , Gregory Kahn , Ronald S. Fearing , Sergey Levine

Active Model Estimation in Markov Decision Processes

We study the problem of efficient exploration in order to learn an accurate model of an environment, modeled as a Markov decision process (MDP). Efficient exploration in this problem requires the agent to identify the regions in which…

Machine Learning · Statistics 2020-06-24 Jean Tarbouriech , Shubhanshu Shekhar , Matteo Pirotta , Mohammad Ghavamzadeh , Alessandro Lazaric

Recursive Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model

We study risk-sensitive reinforcement learning in finite discounted MDPs with recursive entropic risk measures (ERM), where the risk parameter $\beta \neq 0$ controls the agent's risk attitude: $\beta>0$ for risk-averse and $\beta<0$ for…

Machine Learning · Computer Science 2026-05-20 Oliver Mortensen , Mohammad Sadegh Talebi

MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning

Model-based reinforcement learning is a widely accepted solution for solving excessive sample demands. However, the predictions of the dynamics models are often not accurate enough, and the resulting bias may incur catastrophic decisions…

Machine Learning · Computer Science 2024-05-03 Wanpeng Zhang , Xi Xiao , Yao Yao , Mingzhe Chen , Dijun Luo

Sample-efficient Cross-Entropy Method for Real-time Planning

Trajectory optimizers for model-based reinforcement learning, such as the Cross-Entropy Method (CEM), can yield compelling results even in high-dimensional control tasks and sparse-reward environments. However, their sampling inefficiency…

Machine Learning · Computer Science 2020-08-17 Cristina Pinneri , Shambhuraj Sawant , Sebastian Blaes , Jan Achterhold , Joerg Stueckler , Michal Rolinek , Georg Martius

Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control

Deep networks have enabled reinforcement learning to scale to more complex and challenging domains, but these methods typically require large quantities of training data. An alternative is to use sample-efficient episodic control methods:…

Machine Learning · Computer Science 2019-11-22 Marta Sarrico , Kai Arulkumaran , Andrea Agostinelli , Pierre Richemond , Anil Anthony Bharath

Efficient Model-Based Reinforcement Learning for Robot Control via Online Optimization

We present an online model-based reinforcement learning algorithm suitable for controlling complex robotic systems directly in the real world. Unlike prevailing sim-to-real pipelines that rely on extensive offline simulation and model-free…

Robotics · Computer Science 2026-05-07 Fang Nan , Hao Ma , Qinghua Guan , Josie Hughes , Michael Muehlebach , Marco Hutter

Robust Model Compression Using Deep Hypotheses

Machine Learning models should ideally be compact and robust. Compactness provides efficiency and comprehensibility whereas robustness provides resilience. Both topics have been studied in recent years but in isolation. Here we present a…

Machine Learning · Computer Science 2021-03-16 Omri Armstrong , Ran Gilad-Bachrach

On Training in Imagination

State-of-the-art model-based reinforcement learning methods train policies on imagined rollouts. These rollouts are trajectories generated by a learned dynamics model and are scored by a learned reward model, but without querying the true…

Machine Learning · Computer Science 2026-05-13 Nadav Timor , Ravid Shwartz-Ziv , Micah Goldblum , Yann LeCun , David Harel

Maximum Entropy Modeling Toolkit

The Maximum Entropy Modeling Toolkit supports parameter estimation and prediction for statistical language models in the maximum entropy framework. The maximum entropy framework provides a constructive method for obtaining the unique…

cmp-lg · Computer Science 2008-02-03 Eric Sven Ristad