Related papers: Gradient Optimization for Single-State RMDPs

Long-term Planning by Short-term Prediction

We consider planning problems, that often arise in autonomous driving applications, in which an agent should decide on immediate actions so as to optimize a long term objective. For example, when a car tries to merge in a roundabout it…

Machine Learning · Computer Science 2016-02-05 Shai Shalev-Shwartz , Nir Ben-Zrihem , Aviad Cohen , Amnon Shashua

Adversarial Deep Reinforcement Learning for Improving the Robustness of Multi-agent Autonomous Driving Policies

Autonomous cars are well known for being vulnerable to adversarial attacks that can compromise the safety of the car and pose danger to other road users. To effectively defend against adversaries, it is required to not only test autonomous…

Artificial Intelligence · Computer Science 2023-02-22 Aizaz Sharif , Dusica Marijan

Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions

Learning in MDPs with highly complex state representations is currently possible due to multiple advancements in reinforcement learning algorithm design. However, this incline in complexity, and furthermore the increase in the dimensions of…

Machine Learning · Computer Science 2023-06-12 Ezgi Korkmaz , Jonah Brown-Cohen

Explainable Data-Driven Optimization: From Context to Decision and Back Again

Data-driven optimization uses contextual information and machine learning algorithms to find solutions to decision problems with uncertain parameters. While a vast body of work is dedicated to interpreting machine learning models in the…

Machine Learning · Computer Science 2023-07-21 Alexandre Forel , Axel Parmentier , Thibaut Vidal

Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty

In this paper, we focus on the problem of robustifying reinforcement learning (RL) algorithms with respect to model uncertainties. Indeed, in the framework of model-based RL, we propose to merge the theory of constrained Markov decision…

Machine Learning · Computer Science 2020-10-13 Reazul Hasan Russel , Mouhacine Benosman , Jeroen Van Baar

Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations

Model-based Reinforcement Learning and Control have demonstrated great potential in various sequential decision making problem domains, including in robotics settings. However, real-world robotics systems often present challenges that limit…

Machine Learning · Computer Science 2023-10-24 Achkan Salehi , Steffen Rühl , Stephane Doncieux

Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary Model

Reinforcement learning has demonstrated impressive performance in various challenging problems such as robotics, board games, and classical arcade games. However, its real-world applications can be hindered by the absence of robustness and…

Machine Learning · Computer Science 2024-07-02 Siemen Herremans , Ali Anwar , Siegfried Mercelis

Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems

This paper describes a purely data-driven solution to a class of sequential decision-making problems with a large number of concurrent online decisions, with applications to computing systems and operations research. We assume that while…

Artificial Intelligence · Computer Science 2019-10-02 Hardik Meisheri , Vinita Baniwal , Nazneen N Sultana , Balaraman Ravindran , Harshad Khadilkar

Using Approximate Models in Robot Learning

Trajectory following is one of the complicated control problems when its dynamics are nonlinear, stochastic and include a large number of parameters. The problem has significant difficulties including a large number of trials required for…

Robotics · Computer Science 2019-02-14 Ali Lenjani

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated negotiation skills with other road users when overtaking, giving way, merging, taking left and right turns and while pushing ahead in unstructured…

Artificial Intelligence · Computer Science 2016-10-12 Shai Shalev-Shwartz , Shaked Shammah , Amnon Shashua

Learning while Respecting Privacy and Robustness to Distributional Uncertainties and Adversarial Data

Data used to train machine learning models can be adversarial--maliciously constructed by adversaries to fool the model. Challenge also arises by privacy, confidentiality, or due to legal constraints when data are geographically gathered…

Machine Learning · Computer Science 2020-07-09 Alireza Sadeghi , Gang Wang , Meng Ma , Georgios B. Giannakis

Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs

Many physical systems have underlying safety considerations that require that the policy employed ensures the satisfaction of a set of constraints. The analytical formulation usually takes the form of a Constrained Markov Decision Process…

Machine Learning · Computer Science 2021-03-03 Aria HasanzadeZonuzy , Archana Bura , Dileep Kalathil , Srinivas Shakkottai

Adversarial Agent Behavior Learning in Autonomous Driving Using Deep Reinforcement Learning

Existing approaches in reinforcement learning train an agent to learn desired optimal behavior in an environment with rule based surrounding agents. In safety critical applications such as autonomous driving it is crucial that the rule…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Arjun Srinivasan , Anubhav Paras , Aniket Bera

Learning Hard Optimization Problems: A Data Generation Perspective

Optimization problems are ubiquitous in our societies and are present in almost every segment of the economy. Most of these optimization problems are NP-hard and computationally demanding, often requiring approximate solutions for…

Optimization and Control · Mathematics 2021-06-23 James Kotary , Ferdinando Fioretto , Pascal Van Hentenryck

Optimization for Supervised Machine Learning: Randomized Algorithms for Data and Parameters

Many key problems in machine learning and data science are routinely modeled as optimization problems and solved via optimization algorithms. With the increase of the volume of data and the size and complexity of the statistical models used…

Optimization and Control · Mathematics 2020-08-28 Filip Hanzely

Robust Machine Learning via Privacy/Rate-Distortion Theory

Robust machine learning formulations have emerged to address the prevalent vulnerability of deep neural networks to adversarial examples. Our work draws the connection between optimal robust learning and the privacy-utility tradeoff…

Machine Learning · Computer Science 2021-05-20 Ye Wang , Shuchin Aeron , Adnan Siraj Rakin , Toshiaki Koike-Akino , Pierre Moulin

Decision-Focused Learning: Through the Lens of Learning to Rank

In the last years decision-focused learning framework, also known as predict-and-optimize, have received increasing attention. In this setting, the predictions of a machine learning model are used as estimated cost coefficients in the…

Machine Learning · Computer Science 2022-06-20 Jayanta Mandi , Víctor Bucarey , Maxime Mulamba , Tias Guns

Sample Complexity of Robust Reinforcement Learning with a Generative Model

The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are robust against the parameter uncertainties due to the mismatches between the simulator model and real-world settings. An RMDP problem is…

Machine Learning · Computer Science 2022-05-17 Kishan Panaganti , Dileep Kalathil

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning

In the predict-then-optimize framework, the objective is to train a predictive model, mapping from environment features to parameters of an optimization problem, which maximizes decision quality when the optimization is subsequently solved.…

Machine Learning · Computer Science 2022-07-19 Kai Wang , Sanket Shah , Haipeng Chen , Andrew Perrault , Finale Doshi-Velez , Milind Tambe

Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization

Creating impact in real-world settings requires artificial intelligence techniques to span the full pipeline from data, to predictive models, to decisions. These components are typically approached separately: a machine learning model is…

Machine Learning · Computer Science 2018-11-22 Bryan Wilder , Bistra Dilkina , Milind Tambe