Related papers: Maximum Likelihood Reinforcement Learning

Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning

While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on estimating a reward function that best explains an expert agent's policy or demonstrated behavior on a control task, it is often the case that such…

Machine Learning · Computer Science 2020-05-01 Dexter R. R. Scobee , S. Shankar Sastry

Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models

Reinforcement learning is used to align language models with human preference signals after first pre-training the model to predict the next token of text within a large corpus using likelihood maximization. Before being deployed in a…

Computation and Language · Computer Science 2024-08-30 Alec Solway

MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Reinforcement learning (RL) algorithms aim to balance exploiting the current best strategy with exploring new options that could lead to higher rewards. Most common RL algorithms use undirected exploration, i.e., select random sequences of…

Machine Learning · Computer Science 2025-08-01 Bhavya Sukhija , Stelian Coros , Andreas Krause , Pieter Abbeel , Carmelo Sferrazza

Inverse Reinforcement Learning with Explicit Policy Estimates

Various methods for solving the inverse reinforcement learning (IRL) problem have been developed independently in machine learning and economics. In particular, the method of Maximum Causal Entropy IRL is based on the perspective of entropy…

Machine Learning · Computer Science 2021-03-05 Navyata Sanghvi , Shinnosuke Usami , Mohit Sharma , Joachim Groeger , Kris Kitani

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Reinforcement Learning with Algorithms from Probabilistic Structure Estimation

Reinforcement learning (RL) algorithms aim to learn optimal decisions in unknown environments through experience of taking actions and observing the rewards gained. In some cases, the environment is not influenced by the actions of the RL…

Machine Learning · Computer Science 2022-06-02 Jonathan P. Epperlein , Roman Overko , Sergiy Zhuk , Christopher King , Djallel Bouneffouf , Andrew Cullen , Robert Shorten

On the Performance of Maximum Likelihood Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) addresses the problem of recovering a task description given a demonstration of the optimal policy used to solve such a task. The optimal policy is usually provided by an expert or teacher, making IRL…

Machine Learning · Computer Science 2012-02-09 Héctor Ratia , Luis Montesano , Ruben Martinez-Cantin

Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

The performance of reinforcement learning (RL) algorithms is sensitive to the choice of hyperparameters, with the learning rate being particularly influential. RL algorithms fail to reach convergence or demand an extensive number of samples…

Machine Learning · Computer Science 2024-08-09 Aida Afshar , Aldo Pacchiano

Automating Predictive Modeling Process using Reinforcement Learning

Building a good predictive model requires an array of activities such as data imputation, feature transformations, estimator selection, hyper-parameter search and ensemble construction. Given the large, complex and heterogenous space of…

Machine Learning · Computer Science 2019-03-06 Udayan Khurana , Horst Samulowitz

Constrained Variational Policy Optimization for Safe Reinforcement Learning

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications. Previous primal-dual style approaches suffer from instability issues and lack optimality…

Machine Learning · Computer Science 2022-06-20 Zuxin Liu , Zhepeng Cen , Vladislav Isenbaev , Wei Liu , Zhiwei Steven Wu , Bo Li , Ding Zhao

InfoRL: Interpretable Reinforcement Learning using Information Maximization

Recent advances in reinforcement learning have proved that given an environment we can learn to perform a task in that environment if we have access to some form of a reward function (dense, sparse or derived from IRL). But most of the…

Machine Learning · Computer Science 2019-05-28 Aadil Hayat , Utsav Singh , Vinay P. Namboodiri

Safe Reinforcement Learning via Probabilistic Shields

This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty. Markov decision processes (MDPs) are prominent models to capture such planning problems. Reinforcement learning…

Artificial Intelligence · Computer Science 2019-11-26 Nils Jansen , Bettina Könighofer , Sebastian Junges , Alexandru C. Serban , Roderick Bloem

Learning Temporal Point Processes via Reinforcement Learning

Social goods, such as healthcare, smart city, and information networks, often produce ordered event data in continuous time. The generative processes of these event data can be very complex, requiring flexible models to capture their…

Machine Learning · Computer Science 2020-12-29 Shuang Li , Shuai Xiao , Shixiang Zhu , Nan Du , Yao Xie , Le Song

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Reinforcement learning (RL) is a powerful machine learning technique that enables an intelligent agent to learn an optimal policy that maximizes the cumulative rewards in sequential decision making. Most of methods in the existing…

Machine Learning · Statistics 2023-01-06 Chengchun Shi , Zhengling Qi , Jianing Wang , Fan Zhou

Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning with Application to Autonomous Driving

In the past decades, we have witnessed significant progress in the domain of autonomous driving. Advanced techniques based on optimization and reinforcement learning (RL) become increasingly powerful at solving the forward problem: given…

Robotics · Computer Science 2020-06-25 Zheng Wu , Liting Sun , Wei Zhan , Chenyu Yang , Masayoshi Tomizuka

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert. Many algorithms for IRL have an inherently nested…

Machine Learning · Computer Science 2022-11-02 Siliang Zeng , Chenliang Li , Alfredo Garcia , Mingyi Hong

Sampling-Based Safe Reinforcement Learning

Safe exploration remains a fundamental challenge in reinforcement learning (RL), limiting the deployment of RL agents in the real world. We propose Sampling-Based Safe Reinforcement Learning (SBSRL), a model-based RL algorithm that…

Machine Learning · Computer Science 2026-05-20 Luca Vignola , Bruce D. Lee , Manish Prajapat , Manuel Wendl , Melanie Zeilinger , Andreas Krause , Yarden As

Maximum Reward Formulation In Reinforcement Learning

Reinforcement learning (RL) algorithms typically deal with maximizing the expected cumulative return (discounted or undiscounted, finite or infinite horizon). However, several crucial applications in the real world, such as drug discovery,…

Machine Learning · Computer Science 2023-12-20 Sai Krishna Gottipati , Yashaswi Pathak , Rohan Nuttall , Sahir , Raviteja Chunduru , Ahmed Touati , Sriram Ganapathi Subramanian , Matthew E. Taylor , Sarath Chandar

Multi-task Reinforcement Learning in Reproducing Kernel Hilbert Spaces via Cross-learning

Reinforcement learning (RL) is a framework to optimize a control policy using rewards that are revealed by the system as a response to a control action. In its standard form, RL involves a single agent that uses its policy to accomplish a…

Systems and Control · Electrical Eng. & Systems 2021-11-24 Juan Cervino , Juan Andres Bazerque , Miguel Calvo-Fullana , Alejandro Ribeiro

Robust optimal well control using an adaptive multi-grid reinforcement learning framework

Reinforcement learning (RL) is a promising tool to solve robust optimal well control problems where the model parameters are highly uncertain, and the system is partially observable in practice. However, RL of robust control policies often…

Machine Learning · Computer Science 2022-07-14 Atish Dixit , Ahmed H. ElSheikh