Related papers: Learning MDPs from Features: Predict-Then-Optimize…

Predict-Then-Optimize by Proxy: Learning Joint Models of Prediction and Optimization

Many real-world decision processes are modeled by optimization problems whose defining parameters are unknown and must be inferred from observable data. The Predict-Then-Optimize framework uses machine learning models to predict unknown…

Machine Learning · Computer Science 2023-11-23 James Kotary , Vincenzo Di Vito , Jacob Christopher , Pascal Van Hentenryck , Ferdinando Fioretto

Learning Joint Models of Prediction and Optimization

The Predict-Then-Optimize framework uses machine learning models to predict unknown parameters of an optimization problem from exogenous features before solving. This setting is common to many real-world decision processes, and recently it…

Machine Learning · Computer Science 2024-09-10 James Kotary , Vincenzo Di Vito , Jacob Cristopher , Pascal Van Hentenryck , Ferdinando Fioretto

Multi-Task Predict-then-Optimize

The predict-then-optimize framework arises in a wide variety of applications where the unknown cost coefficients of an optimization problem are first predicted based on contextual features and then used to solve the problem. In this work,…

Optimization and Control · Mathematics 2023-05-02 Bo Tang , Elias B. Khalil

Scaling Up Robust MDPs by Reinforcement Learning

We consider large-scale Markov decision processes (MDPs) with parameter uncertainty, under the robust MDP paradigm. Previous studies showed that robust MDPs, based on a minimax approach to handle uncertainty, can be solved using dynamic…

Machine Learning · Computer Science 2013-06-27 Aviv Tamar , Huan Xu , Shie Mannor

Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework

We present a framework to address a class of sequential decision making problems. Our framework features learning the optimal control policy with robustness to noisy data, determining the unknown state and action parameters, and performing…

Machine Learning · Computer Science 2022-01-20 Amber Srivastava , Srinivasa M Salapaka

Predict-then-Calibrate: A New Perspective of Robust Contextual LP

Contextual optimization, also known as predict-then-optimize or prescriptive analytics, considers an optimization problem with the presence of covariates (context or side information). The goal is to learn a prediction model (from the…

Optimization and Control · Mathematics 2024-05-13 Chunlin Sun , Linyu Liu , Xiaocheng Li

Feature Selection Using Reinforcement Learning

With the decreasing cost of data collection, the space of variables or features that can be used to characterize a particular predictor of interest continues to grow exponentially. Therefore, identifying the most characterizing features…

Machine Learning · Computer Science 2021-01-26 Sali Rasoul , Sodiq Adewole , Alphonse Akakpo

An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making

We present an integrated prediction-optimization (PredOpt) framework to efficiently solve sequential decision-making problems by predicting the values of binary decision variables in an optimal solution. We address the key issues of…

Machine Learning · Computer Science 2023-11-14 Dogacan Yilmaz , İ. Esra Büyüktahtakın

Decision-Focused Learning: Through the Lens of Learning to Rank

In the last years decision-focused learning framework, also known as predict-and-optimize, have received increasing attention. In this setting, the predictions of a machine learning model are used as estimated cost coefficients in the…

Machine Learning · Computer Science 2022-06-20 Jayanta Mandi , Víctor Bucarey , Maxime Mulamba , Tias Guns

A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems

We study a class of sequential decision-making problems with augmented predictions, potentially provided by a machine learning algorithm. In this setting, the decision-maker receives prediction intervals for unknown parameters that become…

Machine Learning · Computer Science 2025-05-05 Xin Chen , Yuze Chen , Yuan Zhou

Feature Reinforcement Learning: Part I: Unstructured MDPs

General-purpose, intelligent, learning agents cycle through sequences of observations, actions, and rewards that are complex, uncertain, unknown, and non-Markovian. On the other hand, reinforcement learning is well-developed for small…

Machine Learning · Computer Science 2009-12-30 Marcus Hutter

Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning

Markov decision processes (MDPs) are used to model a wide variety of applications ranging from game playing over robotics to finance. Their optimal policy typically maximizes the expected sum of rewards given at each step of the decision…

Machine Learning · Computer Science 2025-05-26 Maximilian Nägele , Jan Olle , Thomas Fösel , Remmy Zen , Florian Marquardt

Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization

Creating impact in real-world settings requires artificial intelligence techniques to span the full pipeline from data, to predictive models, to decisions. These components are typically approached separately: a machine learning model is…

Machine Learning · Computer Science 2018-11-22 Bryan Wilder , Bistra Dilkina , Milind Tambe

Derivative-Free Reinforcement Learning: A Review

Reinforcement learning is about learning agent models that make the best sequential decisions in unknown environments. In an unknown environment, the agent needs to explore the environment while exploiting the collected information, which…

Machine Learning · Computer Science 2021-02-12 Hong Qian , Yang Yu

Optimizing Sequential Experimental Design with Deep Reinforcement Learning

Bayesian approaches developed to solve the optimal design of sequential experiments are mathematically elegant but computationally challenging. Recently, techniques using amortization have been proposed to make these Bayesian approaches…

Machine Learning · Computer Science 2022-06-20 Tom Blau , Edwin V. Bonilla , Iadine Chades , Amir Dezfouli

A reinforced learning approach to optimal design under model uncertainty

Optimal designs are usually model-dependent and likely to be sub-optimal if the postulated model is not correctly specified. In practice, it is common that a researcher has a list of candidate models at hand and a design has to be found…

Statistics Theory · Mathematics 2023-03-29 Mingyao Ai , Holger Dette , Zhengfu Liu , Jun Yu

Reinforcement Learning of Markov Decision Processes with Peak Constraints

In this paper, we consider reinforcement learning of Markov Decision Processes (MDP) with peak constraints, where an agent chooses a policy to optimize an objective and at the same time satisfy additional constraints. The agent has to take…

Optimization and Control · Mathematics 2019-12-09 Ather Gattami

Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning

Many sequential decision-making problems that are currently automated, such as those in manufacturing or recommender systems, operate in an environment where there is either little uncertainty, or zero risk of catastrophe. As companies and…

Machine Learning · Computer Science 2023-04-04 Marc Rigter

Reinforcement Learning under Model Mismatch

We study reinforcement learning under model misspecification, where we do not have access to the true environment but only to a reasonably close approximation to it. We address this problem by extending the framework of robust MDPs to the…

Machine Learning · Computer Science 2017-11-10 Aurko Roy , Huan Xu , Sebastian Pokutta

Data Augmentation through Expert-guided Symmetry Detection to Improve Performance in Offline Reinforcement Learning

Offline estimation of the dynamical model of a Markov Decision Process (MDP) is a non-trivial task that greatly depends on the data available in the learning phase. Sometimes the dynamics of the model is invariant with respect to some…

Machine Learning · Computer Science 2023-04-13 Giorgio Angelotti , Nicolas Drougard , Caroline P. C. Chanel