Related papers: Entropy-Guided Control Improvisation

Formal Entropy-Regularized Control of Stochastic Systems

Analyzing and controlling system entropy is a powerful tool for regulating predictability of control systems. Applications benefiting from such approaches range from reinforcement learning and data security to human-robot collaboration. In…

Systems and Control · Electrical Eng. & Systems 2026-03-06 Menno van Zutphen , Giannis Delimpaltadakis , Duarte J. Antunes

Permissive Controller Synthesis for Probabilistic Systems

We propose novel controller synthesis techniques for probabilistic systems modelled using stochastic two-player games: one player acts as a controller, the second represents its environment, and probability is used to capture uncertainty…

Logic in Computer Science · Computer Science 2017-01-11 Klaus Drager , Vojtech Forejt , Marta Kwiatkowska , David Parker , Mateusz Ujma

Randomized Synthesis for Diversity and Cost Constraints with Control Improvisation

In many synthesis problems, it can be essential to generate implementations which not only satisfy functional constraints but are also randomized to improve variety, robustness, or unpredictability. The recently-proposed framework of…

Logic in Computer Science · Computer Science 2022-06-07 Andreas Gittis , Eric Vin , Daniel J. Fremont

Entropy Regularised Deterministic Optimal Control: From Path Integral Solution to Sample-Based Trajectory Optimisation

Sample-based trajectory optimisers are a promising tool for the control of robotics with non-differentiable dynamics and cost functions. Contemporary approaches derive from a restricted subclass of stochastic optimal control where the…

Robotics · Computer Science 2021-10-07 Tom Lefebvre , Guillaume Crevecoeur

Predictable Interval MDPs through Entropy Regularization

Regularization of control policies using entropy can be instrumental in adjusting predictability of real-world systems. Applications benefiting from such approaches range from, e.g., cybersecurity, which aims at maximal unpredictability, to…

Systems and Control · Electrical Eng. & Systems 2026-02-18 Menno van Zutphen , Giannis Delimpaltadakis , Maurice Heemels , Duarte Antunes

Reactive Control Improvisation

Reactive synthesis is a paradigm for automatically building correct-by-construction systems that interact with an unknown or adversarial environment. We study how to do reactive synthesis when part of the specification of the system is that…

Logic in Computer Science · Computer Science 2018-04-23 Daniel J. Fremont , Sanjit A. Seshia

Entropic Risk Constrained Soft-Robust Policy Optimization

Having a perfect model to compute the optimal policy is often infeasible in reinforcement learning. It is important in high-stakes domains to quantify and manage risk induced by model uncertainties. Entropic risk measure is an exponential…

Machine Learning · Computer Science 2020-06-23 Reazul Hasan Russel , Bahram Behzadian , Marek Petrik

Understanding the impact of entropy on policy optimization

Entropy regularization is commonly used to improve policy optimization in reinforcement learning. It is believed to help with \emph{exploration} by encouraging the selection of more stochastic policies. In this work, we analyze this claim…

Machine Learning · Computer Science 2019-06-11 Zafarali Ahmed , Nicolas Le Roux , Mohammad Norouzi , Dale Schuurmans

On the Convergence of Approximate and Regularized Policy Iteration Schemes

Entropy regularized algorithms such as Soft Q-learning and Soft Actor-Critic, recently showed state-of-the-art performance on a number of challenging reinforcement learning (RL) tasks. The regularized formulation modifies the standard RL…

Machine Learning · Statistics 2019-10-15 Elena Smirnova , Elvis Dohmatob

Probabilities Are Not Enough: Formal Controller Synthesis for Stochastic Dynamical Models with Epistemic Uncertainty

Capturing uncertainty in models of complex dynamical systems is crucial to designing safe controllers. Stochastic noise causes aleatoric uncertainty, whereas imprecise knowledge of model parameters leads to epistemic uncertainty. Several…

Systems and Control · Electrical Eng. & Systems 2022-12-08 Thom Badings , Licio Romao , Alessandro Abate , Nils Jansen

Entropic Risk for Turn-Based Stochastic Games

Entropic risk (ERisk) is an established risk measure in finance, quantifying risk by an exponential re-weighting of rewards. We study ERisk for the first time in the context of turn-based stochastic games with the total reward objective.…

Computer Science and Game Theory · Computer Science 2023-07-14 Christel Baier , Krishnendu Chatterjee , Tobias Meggendorfer , Jakob Piribauer

Entropy-regularized Point-based Value Iteration

Model-based planners for partially observable problems must accommodate both model uncertainty during planning and goal uncertainty during objective inference. However, model-based planners may be brittle under these types of uncertainty…

Artificial Intelligence · Computer Science 2024-02-15 Harrison Delecki , Marcell Vazquez-Chanlatte , Esen Yel , Kyle Wray , Tomer Arnon , Stefan Witwicki , Mykel J. Kochenderfer

Extending the Entropic Potential of Events for Uncertainty Quantification and Decision-Making in Artificial Intelligence

This work demonstrates how the concept of the entropic potential of events -- a parameter quantifying the influence of discrete events on the expected future entropy of a system -- can enhance uncertainty quantification, decision-making,…

Artificial Intelligence · Computer Science 2025-08-15 Mark Zilberman

Backward Stochastic Control System with Entropy Regularization

The entropy regularization is inspired by information entropy from machine learning and the ideas of exploration and exploitation in reinforcement learning, which appears in the control problem to design an approximating algorithm for the…

Optimization and Control · Mathematics 2024-11-21 Ziyue Chen , Qi Zhang

Implicit Policy for Reinforcement Learning

We introduce Implicit Policy, a general class of expressive policies that can flexibly represent complex action distributions in reinforcement learning, with efficient algorithms to compute entropy regularized policy gradients. We…

Machine Learning · Computer Science 2019-02-05 Yunhao Tang , Shipra Agrawal

Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review

The framework of reinforcement learning or optimal control provides a mathematical formalization of intelligent decision making that is powerful and broadly applicable. While the general form of the reinforcement learning problem enables…

Machine Learning · Computer Science 2018-05-22 Sergey Levine

Entropy, Computing and Rationality

Making decisions freely presupposes that there is some indeterminacy in the environment and in the decision making engine. The former is reflected on the behavioral changes due to communicating: few changes indicate rigid environments;…

Artificial Intelligence · Computer Science 2020-09-23 Luis A. Pineda

Determinism, Complexity, and Predictability in Computer Performance

Computers are deterministic dynamical systems (CHAOS 19:033124, 2009). Among other things, that implies that one should be able to use deterministic forecast rules to predict their behavior. That statement is sometimes-but not always-true.…

Chaotic Dynamics · Physics 2013-05-24 Joshua Garland , Ryan James , Elizabeth Bradley

Robust Entropy-regularized Markov Decision Processes

Stochastic and soft optimal policies resulting from entropy-regularized Markov decision processes (ER-MDP) are desirable for exploration and imitation learning applications. Motivated by the fact that such policies are sensitive with…

Machine Learning · Computer Science 2022-01-03 Tien Mai , Patrick Jaillet

Robust Dynamic Programming for Temporal Logic Control of Stochastic Systems

Discrete-time stochastic systems are an essential modelling tool for many engineering systems. We consider stochastic control systems that are evolving over continuous spaces. For this class of models, methods for the formal verification…

Systems and Control · Computer Science 2018-11-29 Sofie Haesaert , Sadegh Soudjani