Related papers: Logically-Constrained Reinforcement Learning

Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees

Reinforcement Learning (RL) has emerged as an efficient method of choice for solving complex sequential decision making problems in automatic control, computer science, economics, and biology. In this paper we present a model-free RL…

Logic in Computer Science · Computer Science 2019-09-13 Mohammadhosein Hasanbeig , Yiannis Kantaros , Alessandro Abate , Daniel Kroening , George J. Pappas , Insup Lee

Reinforcement Learning Based Temporal Logic Control with Soft Constraints Using Limit-deterministic Generalized Buchi Automata

This paper studies the control synthesis of motion planning subject to uncertainties. The uncertainties are considered in robot motions and environment properties, giving rise to the probabilistic labeled Markov decision process (PL-MDP). A…

Robotics · Computer Science 2023-01-31 Mingyu Cai , Shaoping Xiao , Zhijun Li , Zhen Kan

Reinforcement Learning Based Temporal Logic Control with Maximum Probabilistic Satisfaction

This paper presents a model-free reinforcement learning (RL) algorithm to synthesize a control policy that maximizes the satisfaction probability of linear temporal logic (LTL) specifications. Due to the consideration of environment and…

Formal Languages and Automata Theory · Computer Science 2022-01-04 Mingyu Cai , Shaoping Xiao , Baoluo Li , Zhiliang Li , Zhen Kan

Certified Reinforcement Learning with Logic Guidance

Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied to a variety of control problems. However, applications in safety-critical domains require a systematic and formal approach to specifying…

Machine Learning · Computer Science 2023-06-07 Hosein Hasanbeig , Daniel Kroening , Alessandro Abate

Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning

We present a reinforcement learning (RL) framework to synthesize a control policy from a given linear temporal logic (LTL) specification in an unknown stochastic environment that can be modeled as a Markov Decision Process (MDP).…

Robotics · Computer Science 2026-04-07 Alper Kamil Bozkurt , Yu Wang , Michael M. Zavlanos , Miroslav Pajic

LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning

LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given linear temporal specification with maximal probability. LCRL…

Machine Learning · Computer Science 2022-09-22 Hosein Hasanbeig , Daniel Kroening , Alessandro Abate

Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees

Linear Temporal Logic (LTL) is widely used to specify high-level objectives for system policies, and it is highly desirable for autonomous systems to learn the optimal policy with respect to such specifications. However, learning the…

Machine Learning · Computer Science 2023-10-26 Daqian Shao , Marta Kwiatkowska

A Framework for Transforming Specifications in Reinforcement Learning

Reactive synthesis algorithms allow automatic construction of policies to control an environment modeled as a Markov Decision Process (MDP) that are optimal with respect to high-level temporal logic specifications. However, they assume that…

Formal Languages and Automata Theory · Computer Science 2022-05-31 Rajeev Alur , Suguman Bansal , Osbert Bastani , Kishor Jothimurugan

Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic

This paper investigates the motion planning of autonomous dynamical systems modeled by Markov decision processes (MDP) with unknown transition probabilities over continuous state and action spaces. Linear temporal logic (LTL) is used to…

Machine Learning · Computer Science 2022-01-25 Mingyu Cai , Mohammadhosein Hasanbeig , Shaoping Xiao , Alessandro Abate , Zhen Kan

Continuous Motion Planning with Temporal Logic Specifications using Deep Neural Networks

In this paper, we propose a model-free reinforcement learning method to synthesize control policies for motion planning problems with continuous states and actions. The robot is modelled as a labeled discrete-time Markov decision process…

Artificial Intelligence · Computer Science 2020-10-01 Chuanzheng Wang , Yinan Li , Stephen L. Smith , Jun Liu

LTL-Constrained Steady-State Policy Synthesis

Decision-making policies for agents are often synthesized with the constraint that a formal specification of behaviour is satisfied. Here we focus on infinite-horizon properties. On the one hand, Linear Temporal Logic (LTL) is a popular…

Artificial Intelligence · Computer Science 2021-06-01 Jan Křetínský

Accelerated Reinforcement Learning for Temporal Logic Control Objectives

This paper addresses the problem of learning control policies for mobile robots, modeled as unknown Markov Decision Processes (MDPs), that are tasked with temporal logic missions, such as sequencing, coverage, or surveillance. The MDP…

Robotics · Computer Science 2022-07-13 Yiannis Kantaros

A Learning Based Approach to Control Synthesis of Markov Decision Processes for Linear Temporal Logic Specifications

We propose to synthesize a control policy for a Markov decision process (MDP) such that the resulting traces of the MDP satisfy a linear temporal logic (LTL) property. We construct a product MDP that incorporates a deterministic Rabin…

Systems and Control · Computer Science 2014-09-22 Dorsa Sadigh , Eric S. Kim , Samuel Coogan , S. Shankar Sastry , Sanjit A. Seshia

Sample-Efficient Reinforcement Learning with Temporal Logic Objectives: Leveraging the Task Specification to Guide Exploration

This paper addresses the problem of learning optimal control policies for systems with uncertain dynamics and high-level control objectives specified as Linear Temporal Logic (LTL) formulas. Uncertainty is considered in the workspace…

Robotics · Computer Science 2024-10-17 Yiannis Kantaros , Jun Wang

Model-Free Reinforcement Learning for Optimal Control of MarkovDecision Processes Under Signal Temporal Logic Specifications

We present a model-free reinforcement learning algorithm to find an optimal policy for a finite-horizon Markov decision process while guaranteeing a desired lower bound on the probability of satisfying a signal temporal logic (STL)…

Systems and Control · Electrical Eng. & Systems 2021-09-29 Krishna C. Kalagarla , Rahul Jain , Pierluigi Nuzzo

Learning-based Bounded Synthesis for Semi-MDPs with LTL Specifications

This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic $K$-co-B\"uchi automaton (d$K$cBA)…

Systems and Control · Electrical Eng. & Systems 2022-04-12 Ryohei Oura , Toshimitsu Ushio

Modular Deep Reinforcement Learning with Temporal Logic Specifications

We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal…

Machine Learning · Computer Science 2019-11-26 Lim Zun Yuan , Mohammadhosein Hasanbeig , Alessandro Abate , Daniel Kroening

Logically-Constrained Neural Fitted Q-Iteration

We propose a method for efficient training of Q-functions for continuous-state Markov Decision Processes (MDPs) such that the traces of the resulting policies satisfy a given Linear Temporal Logic (LTL) property. LTL, a modal logic, can…

Machine Learning · Computer Science 2019-03-15 Mohammadhosein Hasanbeig , Alessandro Abate , Daniel Kroening

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

To overcome the curses of dimensionality and modeling of Dynamic Programming (DP) methods to solve Markov Decision Process (MDP) problems, Reinforcement Learning (RL) methods are adopted in practice. Contrary to traditional RL algorithms…

Machine Learning · Computer Science 2021-08-24 Arghyadip Roy , Vivek Borkar , Abhay Karandikar , Prasanna Chaporkar

Formal Policy Synthesis for Continuous-Space Systems via Reinforcement Learning

This paper studies satisfaction of temporal properties on unknown stochastic processes that have continuous state spaces. We show how reinforcement learning (RL) can be applied for computing policies that are finite-memory and deterministic…

Systems and Control · Electrical Eng. & Systems 2020-09-29 Milad Kazemi , Sadegh Soudjani