English
Related papers

Related papers: Maximum Entropy Model Rollouts: Fast Model Based P…

200 papers

Maximum entropy models provide the least constrained probability distributions that reproduce statistical properties of experimental datasets. In this work we characterize the learning dynamics that maximizes the log-likelihood in the case…

Disordered Systems and Neural Networks · Physics 2016-09-21 Ulisse Ferrari

Reinforcement learning algorithms can acquire policies for complex tasks autonomously. However, the number of samples required to learn a diverse set of skills can be prohibitively large. While meta-reinforcement learning methods have…

Machine Learning · Computer Science 2020-06-17 Russell Mendonca , Xinyang Geng , Chelsea Finn , Sergey Levine

Model-based reinforcement learning promises to learn an optimal policy from fewer interactions with the environment compared to model-free reinforcement learning by learning an intermediate model of the environment in order to predict…

Machine Learning · Computer Science 2022-06-08 Abhinav Bhatia , Philip S. Thomas , Shlomo Zilberstein

Model-based reinforcement learning approaches carry the promise of being data efficient. However, due to challenges in learning dynamics models that sufficiently match the real-world dynamics, they struggle to achieve the same asymptotic…

Machine Learning · Computer Science 2018-09-17 Ignasi Clavera , Jonas Rothfuss , John Schulman , Yasuhiro Fujita , Tamim Asfour , Pieter Abbeel

Model-based reinforcement learning (MBRL) seeks to enhance data efficiency by learning a model of the environment and generating synthetic rollouts from it. However, accumulated model errors during these rollouts can distort the data…

Machine Learning · Computer Science 2025-04-09 Bernd Frauenknecht , Devdutt Subhasish , Friedrich Solowjow , Sebastian Trimpe

Model-free deep-reinforcement-based learning algorithms have been applied to a range of COPs~\cite{bello2016neural}~\cite{kool2018attention}~\cite{nazari2018reinforcement}. However, these approaches suffer from two key challenges when…

Machine Learning · Computer Science 2022-06-01 Nasrin Sultana , Jeffrey Chan , Tabinda Sarwar , A. K. Qin

Recent advances in reinforcement learning have demonstrated its ability to solve hard agent-environment interaction tasks on a super-human level. However, the application of reinforcement learning methods to practical and real-world tasks…

Artificial Intelligence · Computer Science 2021-12-03 Oleg Svidchenko , Aleksei Shpilman

Model-based reinforcement learning attempts to use an available or learned model to improve the data efficiency of reinforcement learning. This work proposes a one-step lookback approach that jointly learns the deep incremental model and…

Robotics · Computer Science 2025-02-28 Cong Li

Model-based reinforcement learning is an appealing framework for creating agents that learn, plan, and act in sequential environments. Model-based algorithms typically involve learning a transition model that takes a state and an action and…

Machine Learning · Computer Science 2019-06-03 Kavosh Asadi , Dipendra Misra , Seungchan Kim , Michel L. Littman

Model-based deep reinforcement learning has achieved success in various domains that require high sample efficiencies, such as Go and robotics. However, there are some remaining issues, such as planning efficient explorations to learn more…

Machine Learning · Computer Science 2021-07-06 Yao Yao , Li Xiao , Zhicheng An , Wanpeng Zhang , Dijun Luo

Model-free deep reinforcement learning algorithms have been shown to be capable of learning a wide range of robotic skills, but typically require a very large number of samples to achieve good performance. Model-based algorithms, in…

Machine Learning · Computer Science 2017-12-05 Anusha Nagabandi , Gregory Kahn , Ronald S. Fearing , Sergey Levine

We study the problem of efficient exploration in order to learn an accurate model of an environment, modeled as a Markov decision process (MDP). Efficient exploration in this problem requires the agent to identify the regions in which…

We study risk-sensitive reinforcement learning in finite discounted MDPs with recursive entropic risk measures (ERM), where the risk parameter $\beta \neq 0$ controls the agent's risk attitude: $\beta>0$ for risk-averse and $\beta<0$ for…

Machine Learning · Computer Science 2026-05-20 Oliver Mortensen , Mohammad Sadegh Talebi

Model-based reinforcement learning is a widely accepted solution for solving excessive sample demands. However, the predictions of the dynamics models are often not accurate enough, and the resulting bias may incur catastrophic decisions…

Machine Learning · Computer Science 2024-05-03 Wanpeng Zhang , Xi Xiao , Yao Yao , Mingzhe Chen , Dijun Luo

Trajectory optimizers for model-based reinforcement learning, such as the Cross-Entropy Method (CEM), can yield compelling results even in high-dimensional control tasks and sparse-reward environments. However, their sampling inefficiency…

Deep networks have enabled reinforcement learning to scale to more complex and challenging domains, but these methods typically require large quantities of training data. An alternative is to use sample-efficient episodic control methods:…

Machine Learning · Computer Science 2019-11-22 Marta Sarrico , Kai Arulkumaran , Andrea Agostinelli , Pierre Richemond , Anil Anthony Bharath

We present an online model-based reinforcement learning algorithm suitable for controlling complex robotic systems directly in the real world. Unlike prevailing sim-to-real pipelines that rely on extensive offline simulation and model-free…

Robotics · Computer Science 2026-05-07 Fang Nan , Hao Ma , Qinghua Guan , Josie Hughes , Michael Muehlebach , Marco Hutter

Machine Learning models should ideally be compact and robust. Compactness provides efficiency and comprehensibility whereas robustness provides resilience. Both topics have been studied in recent years but in isolation. Here we present a…

Machine Learning · Computer Science 2021-03-16 Omri Armstrong , Ran Gilad-Bachrach

State-of-the-art model-based reinforcement learning methods train policies on imagined rollouts. These rollouts are trajectories generated by a learned dynamics model and are scored by a learned reward model, but without querying the true…

Machine Learning · Computer Science 2026-05-13 Nadav Timor , Ravid Shwartz-Ziv , Micah Goldblum , Yann LeCun , David Harel

The Maximum Entropy Modeling Toolkit supports parameter estimation and prediction for statistical language models in the maximum entropy framework. The maximum entropy framework provides a constructive method for obtaining the unique…

cmp-lg · Computer Science 2008-02-03 Eric Sven Ristad
‹ Prev 1 2 3 10 Next ›