Related papers: Dynamically Optimal Treatment Allocation

Learning When-to-Treat Policies

Many applied decision-making problems have a dynamic component: The policymaker needs not only to choose whom to treat, but also when to start which treatment. For example, a medical doctor may choose between postponing treatment (watchful…

Methodology · Statistics 2020-05-01 Xinkun Nie , Emma Brunskill , Stefan Wager

Reinforcement Learning Policies in Continuous-Time Linear Systems

Linear dynamical systems that obey stochastic differential equations are canonical models. While optimal control of known systems has a rich literature, the problem is technically hard under model uncertainty and there are hardly any…

Systems and Control · Electrical Eng. & Systems 2023-06-09 Mohamad Kazem Shirani Faradonbeh , Mohamad Sadegh Shirani Faradonbeh

Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraints

Many policies involve dynamics in their treatment assignments, where individuals receive sequential interventions over multiple stages. We study estimation of an optimal dynamic treatment regime that guides the optimal treatment assignment…

Econometrics · Economics 2024-09-04 Shosei Sakaguchi

Action-State Dependent Dynamic Model Selection

A model among many may only be best under certain states of the world. Switching from a model to another can also be costly. Finding a procedure to dynamically choose a model in these circumstances requires to solve a complex estimation…

Machine Learning · Computer Science 2023-10-10 Francesco Cordoni , Alessio Sancetta

Reinforcement Learning for Dynamic Memory Allocation

In recent years, reinforcement learning (RL) has gained popularity and has been applied to a wide range of tasks. One such popular domain where RL has been effective is resource management problems in systems. We look to extend work on RL…

Machine Learning · Computer Science 2025-10-09 Arisrei Lim , Abhiram Maddukuri

Policy Learning for Optimal Dynamic Treatment Regimes with Observational Data

Public policies and medical interventions often involve dynamic treatment assignments, in which individuals receive a sequence of interventions over multiple stages. We study the statistical learning of optimal dynamic treatment regimes…

Methodology · Statistics 2025-05-21 Shosei Sakaguchi

Stochastic Resetting Accelerates Policy Convergence in Reinforcement Learning

Stochastic resetting, where a dynamical process is intermittently returned to a fixed reference state, has emerged as a powerful mechanism for optimizing first-passage properties. Existing theory largely treats static, non-learning…

Machine Learning · Computer Science 2026-03-18 Jello Zhou , Vudtiwat Ngampruetikorn , David J. Schwab

Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach

In order for reinforcement learning techniques to be useful in real-world decision making processes, they must be able to produce robust performance from limited data. Deep policy optimization methods have achieved impressive results on…

Machine Learning · Computer Science 2020-12-22 James Queeney , Ioannis Ch. Paschalidis , Christos G. Cassandras

Dynamical Priors as a Training Objective in Reinforcement Learning

Standard reinforcement learning (RL) optimizes policies for reward but imposes few constraints on how decisions evolve over time. As a result, policies may achieve high performance while exhibiting temporally incoherent behavior such as…

Machine Learning · Computer Science 2026-04-24 Sukesh Subaharan

Dynamic allocation of limited memory resources in reinforcement learning

Biological brains are inherently limited in their capacity to process and store information, but are nevertheless capable of solving complex tasks with apparent ease. Intelligent behavior is related to these limitations, since resource…

Neurons and Cognition · Quantitative Biology 2020-11-16 Nisheet Patel , Luigi Acerbi , Alexandre Pouget

More Efficient Policy Learning via Optimal Retargeting

Policy learning can be used to extract individualized treatment regimes from observational data in healthcare, civics, e-commerce, and beyond. One big hurdle to policy learning is a commonplace lack of overlap in the data for different…

Machine Learning · Statistics 2020-12-04 Nathan Kallus

Efficient Dynamic Allocation Policy for Robust Ranking and Selection under Stochastic Control Framework

This research considers the ranking and selection with input uncertainty. The objective is to maximize the posterior probability of correctly selecting the best alternative under a fixed simulation budget, where each alternative is measured…

Optimization and Control · Mathematics 2023-05-15 Hui Xiao , Zhihong Wei

Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation

While Reinforcement Learning can achieve impressive results for complex tasks, the learned policies are generally prone to fail in downstream tasks with even minor model mismatch or unexpected perturbations. Recent works have demonstrated…

Machine Learning · Computer Science 2023-05-23 Kang Xu , Yan Ma , Bingsheng Wei , Wei Li

Can Learned Optimization Make Reinforcement Learning Less Difficult?

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from…

Machine Learning · Computer Science 2025-04-16 Alexander David Goldie , Chris Lu , Matthew Thomas Jackson , Shimon Whiteson , Jakob Nicolaus Foerster

Reinforcement Learning in Economics and Finance

Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal…

Theoretical Economics · Economics 2020-03-24 Arthur Charpentier , Romuald Elie , Carl Remlinger

Non-Deterministic Policies in Markovian Decision Processes

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making problems in such environments. In recent years, attempts were made…

Artificial Intelligence · Computer Science 2014-01-17 Mahdi Milani Fard , Joelle Pineau

Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints

Reinforcement learning can greatly benefit from the use of options as a way of encoding recurring behaviours and to foster exploration. An important open problem is how can an agent autonomously learn useful options when solving particular…

Machine Learning · Computer Science 2020-01-07 Manuel Del Verme , Bruno Castro da Silva , Gianluca Baldassarre

Dynamic Inverse Optimization under Drift and Shocks: Theory, Regret Bounds, and Applications

The growing prevalence of drift and shocks in modern decision environments exposes a gap between classical optimization theory and real-world practice. Standard models assume fixed objectives, yet organizations from hospitals to power grids…

Computational Finance · Quantitative Finance 2025-09-18 JINHO CHA

Optimal Dynamic Treatment Regimes and Partial Welfare Ordering

Dynamic treatment regimes are treatment allocations tailored to heterogeneous individuals. The optimal dynamic treatment regime is a regime that maximizes counterfactual welfare. We introduce a framework in which we can partially learn the…

Econometrics · Economics 2021-07-14 Sukjin Han

Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies

Researchers have demonstrated that Deep Reinforcement Learning (DRL) is a powerful tool for finding policies that perform well on complex robotic systems. However, these policies are often unpredictable and can induce highly variable…

Robotics · Computer Science 2022-03-08 Sean Gillen , Asutay Ozmen , Katie Byl