English
Related papers

Related papers: Logically-Constrained Reinforcement Learning

200 papers

Reinforcement Learning (RL) has emerged as an efficient method of choice for solving complex sequential decision making problems in automatic control, computer science, economics, and biology. In this paper we present a model-free RL…

Logic in Computer Science · Computer Science 2019-09-13 Mohammadhosein Hasanbeig , Yiannis Kantaros , Alessandro Abate , Daniel Kroening , George J. Pappas , Insup Lee

This paper studies the control synthesis of motion planning subject to uncertainties. The uncertainties are considered in robot motions and environment properties, giving rise to the probabilistic labeled Markov decision process (PL-MDP). A…

Robotics · Computer Science 2023-01-31 Mingyu Cai , Shaoping Xiao , Zhijun Li , Zhen Kan

This paper presents a model-free reinforcement learning (RL) algorithm to synthesize a control policy that maximizes the satisfaction probability of linear temporal logic (LTL) specifications. Due to the consideration of environment and…

Formal Languages and Automata Theory · Computer Science 2022-01-04 Mingyu Cai , Shaoping Xiao , Baoluo Li , Zhiliang Li , Zhen Kan

Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied to a variety of control problems. However, applications in safety-critical domains require a systematic and formal approach to specifying…

Machine Learning · Computer Science 2023-06-07 Hosein Hasanbeig , Daniel Kroening , Alessandro Abate

We present a reinforcement learning (RL) framework to synthesize a control policy from a given linear temporal logic (LTL) specification in an unknown stochastic environment that can be modeled as a Markov Decision Process (MDP).…

Robotics · Computer Science 2026-04-07 Alper Kamil Bozkurt , Yu Wang , Michael M. Zavlanos , Miroslav Pajic

LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given linear temporal specification with maximal probability. LCRL…

Machine Learning · Computer Science 2022-09-22 Hosein Hasanbeig , Daniel Kroening , Alessandro Abate

Linear Temporal Logic (LTL) is widely used to specify high-level objectives for system policies, and it is highly desirable for autonomous systems to learn the optimal policy with respect to such specifications. However, learning the…

Machine Learning · Computer Science 2023-10-26 Daqian Shao , Marta Kwiatkowska

Reactive synthesis algorithms allow automatic construction of policies to control an environment modeled as a Markov Decision Process (MDP) that are optimal with respect to high-level temporal logic specifications. However, they assume that…

Formal Languages and Automata Theory · Computer Science 2022-05-31 Rajeev Alur , Suguman Bansal , Osbert Bastani , Kishor Jothimurugan

This paper investigates the motion planning of autonomous dynamical systems modeled by Markov decision processes (MDP) with unknown transition probabilities over continuous state and action spaces. Linear temporal logic (LTL) is used to…

Machine Learning · Computer Science 2022-01-25 Mingyu Cai , Mohammadhosein Hasanbeig , Shaoping Xiao , Alessandro Abate , Zhen Kan

In this paper, we propose a model-free reinforcement learning method to synthesize control policies for motion planning problems with continuous states and actions. The robot is modelled as a labeled discrete-time Markov decision process…

Artificial Intelligence · Computer Science 2020-10-01 Chuanzheng Wang , Yinan Li , Stephen L. Smith , Jun Liu

Decision-making policies for agents are often synthesized with the constraint that a formal specification of behaviour is satisfied. Here we focus on infinite-horizon properties. On the one hand, Linear Temporal Logic (LTL) is a popular…

Artificial Intelligence · Computer Science 2021-06-01 Jan Křetínský

This paper addresses the problem of learning control policies for mobile robots, modeled as unknown Markov Decision Processes (MDPs), that are tasked with temporal logic missions, such as sequencing, coverage, or surveillance. The MDP…

Robotics · Computer Science 2022-07-13 Yiannis Kantaros

We propose to synthesize a control policy for a Markov decision process (MDP) such that the resulting traces of the MDP satisfy a linear temporal logic (LTL) property. We construct a product MDP that incorporates a deterministic Rabin…

Systems and Control · Computer Science 2014-09-22 Dorsa Sadigh , Eric S. Kim , Samuel Coogan , S. Shankar Sastry , Sanjit A. Seshia

This paper addresses the problem of learning optimal control policies for systems with uncertain dynamics and high-level control objectives specified as Linear Temporal Logic (LTL) formulas. Uncertainty is considered in the workspace…

Robotics · Computer Science 2024-10-17 Yiannis Kantaros , Jun Wang

We present a model-free reinforcement learning algorithm to find an optimal policy for a finite-horizon Markov decision process while guaranteeing a desired lower bound on the probability of satisfying a signal temporal logic (STL)…

Systems and Control · Electrical Eng. & Systems 2021-09-29 Krishna C. Kalagarla , Rahul Jain , Pierluigi Nuzzo

This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic $K$-co-B\"uchi automaton (d$K$cBA)…

Systems and Control · Electrical Eng. & Systems 2022-04-12 Ryohei Oura , Toshimitsu Ushio

We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal…

Machine Learning · Computer Science 2019-11-26 Lim Zun Yuan , Mohammadhosein Hasanbeig , Alessandro Abate , Daniel Kroening

We propose a method for efficient training of Q-functions for continuous-state Markov Decision Processes (MDPs) such that the traces of the resulting policies satisfy a given Linear Temporal Logic (LTL) property. LTL, a modal logic, can…

Machine Learning · Computer Science 2019-03-15 Mohammadhosein Hasanbeig , Alessandro Abate , Daniel Kroening

To overcome the curses of dimensionality and modeling of Dynamic Programming (DP) methods to solve Markov Decision Process (MDP) problems, Reinforcement Learning (RL) methods are adopted in practice. Contrary to traditional RL algorithms…

Machine Learning · Computer Science 2021-08-24 Arghyadip Roy , Vivek Borkar , Abhay Karandikar , Prasanna Chaporkar

This paper studies satisfaction of temporal properties on unknown stochastic processes that have continuous state spaces. We show how reinforcement learning (RL) can be applied for computing policies that are finite-memory and deterministic…

Systems and Control · Electrical Eng. & Systems 2020-09-29 Milad Kazemi , Sadegh Soudjani
‹ Prev 1 2 3 10 Next ›