Related papers: Interpretable Apprenticeship Learning with Tempora…

Accelerated Reinforcement Learning for Temporal Logic Control Objectives

This paper addresses the problem of learning control policies for mobile robots, modeled as unknown Markov Decision Processes (MDPs), that are tasked with temporal logic missions, such as sequencing, coverage, or surveillance. The MDP…

Robotics · Computer Science 2022-07-13 Yiannis Kantaros

A Learning Based Approach to Control Synthesis of Markov Decision Processes for Linear Temporal Logic Specifications

We propose to synthesize a control policy for a Markov decision process (MDP) such that the resulting traces of the MDP satisfy a linear temporal logic (LTL) property. We construct a product MDP that incorporates a deterministic Rabin…

Systems and Control · Computer Science 2014-09-22 Dorsa Sadigh , Eric S. Kim , Samuel Coogan , S. Shankar Sastry , Sanjit A. Seshia

Continuous Motion Planning with Temporal Logic Specifications using Deep Neural Networks

In this paper, we propose a model-free reinforcement learning method to synthesize control policies for motion planning problems with continuous states and actions. The robot is modelled as a labeled discrete-time Markov decision process…

Artificial Intelligence · Computer Science 2020-10-01 Chuanzheng Wang , Yinan Li , Stephen L. Smith , Jun Liu

Learning Probabilistic Temporal Logic Specifications for Stochastic Systems

There has been substantial progress in the inference of formal behavioural specifications from sample trajectories, for example, using Linear Temporal Logic (LTL). However, these techniques cannot handle specifications that correctly…

Logic in Computer Science · Computer Science 2025-05-20 Rajarshi Roy , Yash Pote , David Parker , Marta Kwiatkowska

Logically-Constrained Neural Fitted Q-Iteration

We propose a method for efficient training of Q-functions for continuous-state Markov Decision Processes (MDPs) such that the traces of the resulting policies satisfy a given Linear Temporal Logic (LTL) property. LTL, a modal logic, can…

Machine Learning · Computer Science 2019-03-15 Mohammadhosein Hasanbeig , Alessandro Abate , Daniel Kroening

MDP Optimal Control under Temporal Logic Constraints

In this paper, we develop a method to automatically generate a control policy for a dynamical system modeled as a Markov Decision Process (MDP). The control specification is given as a Linear Temporal Logic (LTL) formula over a set of…

Robotics · Computer Science 2011-03-24 Xu Chu Ding , Stephen L. Smith , Calin Belta , Daniela Rus

Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees

Linear Temporal Logic (LTL) is widely used to specify high-level objectives for system policies, and it is highly desirable for autonomous systems to learn the optimal policy with respect to such specifications. However, learning the…

Machine Learning · Computer Science 2023-10-26 Daqian Shao , Marta Kwiatkowska

Optimal Control of Partially Observable Markov Decision Processes with Finite Linear Temporal Logic Constraints

Autonomous agents often operate in scenarios where the state is partially observed. In addition to maximizing their cumulative reward, agents must execute complex tasks with rich temporal and logical structures. These tasks can be expressed…

Systems and Control · Electrical Eng. & Systems 2022-03-18 Krishna C. Kalagarla , Dhruva Kartik , Dongming Shen , Rahul Jain , Ashutosh Nayyar , Pierluigi Nuzzo

Learning Interpretable Classifiers for PDDL Planning

We consider the problem of synthesizing interpretable models that recognize the behaviour of an agent compared to other agents, on a whole set of similar planning tasks expressed in PDDL. Our approach consists in learning logical formulas,…

Artificial Intelligence · Computer Science 2024-10-15 Arnaud Lequen

Optimal Control of Markov Decision Processes for Efficiency with Linear Temporal Logic Tasks

We investigate the problem of optimal control synthesis for Markov Decision Processes (MDPs), addressing both qualitative and quantitative objectives. Specifically, we require the system to satisfy a qualitative task specified by a Linear…

Systems and Control · Electrical Eng. & Systems 2025-09-19 Yu Chen , Xuanyuan Yin , Shaoyuan Li , Xiang Yin

Sample-Efficient Reinforcement Learning with Temporal Logic Objectives: Leveraging the Task Specification to Guide Exploration

This paper addresses the problem of learning optimal control policies for systems with uncertain dynamics and high-level control objectives specified as Linear Temporal Logic (LTL) formulas. Uncertainty is considered in the workspace…

Robotics · Computer Science 2024-10-17 Yiannis Kantaros , Jun Wang

Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

We consider partially observable Markov decision processes (POMDPs), that are a standard framework for robotics applications to model uncertainties present in the real world, with temporal logic specifications. All temporal logic…

Logic in Computer Science · Computer Science 2015-02-19 Krishnendu Chatterjee , Martin Chmelík , Raghav Gupta , Ayush Kanodia

Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning

We present a reinforcement learning (RL) framework to synthesize a control policy from a given linear temporal logic (LTL) specification in an unknown stochastic environment that can be modeled as a Markov Decision Process (MDP).…

Robotics · Computer Science 2026-04-07 Alper Kamil Bozkurt , Yu Wang , Michael M. Zavlanos , Miroslav Pajic

Optimal Control of MDPs with Temporal Logic Constraints

In this paper, we focus on formal synthesis of control policies for finite Markov decision processes with non-negative real-valued costs. We develop an algorithm to automatically generate a policy that guarantees the satisfaction of a…

Logic in Computer Science · Computer Science 2013-09-10 Maria Svorenova , Ivana Cerna , Calin Belta

Directed Exploration in Reinforcement Learning from Linear Temporal Logic

Linear temporal logic (LTL) is a powerful language for task specification in reinforcement learning, as it allows describing objectives beyond the expressivity of conventional discounted return formulations. Nonetheless, recent works have…

Machine Learning · Computer Science 2025-06-11 Marco Bagatella , Andreas Krause , Georg Martius

Optimal Mission Planner with Timed Temporal Logic Constraints

In this paper, we present an optimization based method for path planning of a mobile robot subject to time bounded temporal constraints, in a dynamic environment. Temporal logic (TL) can address very complex task specification such as…

Systems and Control · Computer Science 2016-04-29 Yuchen Zhou , Dipankar Maity , John S. Baras

A Framework for Transforming Specifications in Reinforcement Learning

Reactive synthesis algorithms allow automatic construction of policies to control an environment modeled as a Markov Decision Process (MDP) that are optimal with respect to high-level temporal logic specifications. However, they assume that…

Formal Languages and Automata Theory · Computer Science 2022-05-31 Rajeev Alur , Suguman Bansal , Osbert Bastani , Kishor Jothimurugan

Convergence Guarantee of Dynamic Programming for LTL Surrogate Reward

Linear Temporal Logic (LTL) is a formal way of specifying complex objectives for planning problems modeled as Markov Decision Processes (MDPs). The planning problem aims to find the optimal policy that maximizes the satisfaction probability…

Robotics · Computer Science 2024-08-13 Zetong Xuan , Yu Wang

LTL2Action: Generalizing LTL Instructions for Multi-Task RL

We address the problem of teaching a deep reinforcement learning (RL) agent to follow instructions in multi-task environments. Instructions are expressed in a well-known formal language -- linear temporal logic (LTL) -- and can specify a…

Artificial Intelligence · Computer Science 2021-07-07 Pashootan Vaezipoor , Andrew Li , Rodrigo Toro Icarte , Sheila McIlraith

Probabilistic Planning with Prioritized Preferences over Temporal Logic Objectives

This paper studies temporal planning in probabilistic environments, modeled as labeled Markov decision processes (MDPs), with user preferences over multiple temporal goals. Existing works reflect such preferences as a prioritized list of…

Formal Languages and Automata Theory · Computer Science 2023-04-25 Lening Li , Hazhar Rahmani , Jie Fu