Related papers: Matching-Based Policy Learning

Importance Weighted Policy Learning and Adaptation

The ability to exploit prior experience to solve novel problems rapidly is a hallmark of biological learning systems and of great practical importance for artificial ones. In the meta reinforcement learning literature much recent work has…

Machine Learning · Computer Science 2021-06-07 Alexandre Galashov , Jakub Sygnowski , Guillaume Desjardins , Jan Humplik , Leonard Hasenclever , Rae Jeong , Yee Whye Teh , Nicolas Heess

More Efficient Policy Learning via Optimal Retargeting

Policy learning can be used to extract individualized treatment regimes from observational data in healthcare, civics, e-commerce, and beyond. One big hurdle to policy learning is a commonplace lack of overlap in the data for different…

Machine Learning · Statistics 2020-12-04 Nathan Kallus

Set-Valued Policy Learning

Conventional treatment policies map patient covariates to a single recommended intervention in order to maximize expected clinical outcomes. Although a rich body of causal inference methods has been developed to estimate such policies,…

Machine Learning · Computer Science 2026-05-20 Laura Fuentes-Vicente , Mathieu Even , Gaëlle Dormion , Antoine Chambaz , Uri Shalit , Julie Josse

Preference-based Conditional Treatment Effects and Policy Learning

We introduce a new preference-based framework for conditional treatment effect estimation and policy learning, built on the Conditional Preference-based Treatment Effect (CPTE). CPTE requires only that outcomes be ranked under a preference…

Machine Learning · Statistics 2026-02-04 Dovid Parnas , Mathieu Even , Julie Josse , Uri Shalit

Learning Continuous Treatment Policy and Bipartite Embeddings for Matching with Heterogeneous Causal Effects

Causal inference methods are widely applied in the fields of medicine, policy, and economics. Central to these applications is the estimation of treatment effects to make decisions. Current methods make binary yes-or-no decisions based on…

Machine Learning · Computer Science 2020-04-24 Will Y. Zou , Smitha Shyam , Michael Mui , Mingshi Wang , Jan Pedersen , Zoubin Ghahramani

Balanced Policy Evaluation and Learning

We present a new approach to the problems of evaluating and learning personalized decision policies from observational data of past contexts, decisions, and outcomes. Only the outcome of the enacted decision is available and the historical…

Machine Learning · Statistics 2019-06-04 Nathan Kallus

Policy Learning under Biased Sample Selection

Practitioners often use data from a randomized controlled trial to learn a treatment assignment policy that can be deployed on a target population. A recurring concern in doing so is that, even if the randomized trial was well-executed…

Econometrics · Economics 2023-04-25 Lihua Lei , Roshni Sahoo , Stefan Wager

Causal-Policy Forest for End-to-End Policy Learning

This study proposes an end-to-end algorithm for policy learning in causal inference. We observe data consisting of covariates, treatment assignments, and outcomes, where only the outcome corresponding to the assigned treatment is observed.…

Econometrics · Economics 2025-12-30 Masahiro Kato

Policy Learning for Balancing Short-Term and Long-Term Rewards

Empirical researchers and decision-makers spanning various domains frequently seek profound insights into the long-term impacts of interventions. While the significance of long-term outcomes is undeniable, an overemphasis on them may…

Machine Learning · Computer Science 2024-09-17 Peng Wu , Ziyu Shen , Feng Xie , Zhongyao Wang , Chunchen Liu , Yan Zeng

Metalearners for Ranking Treatment Effects

Efficiently allocating treatments with a budget constraint constitutes an important challenge across various domains. In marketing, for example, the use of promotions to target potential customers and boost conversions is limited by the…

Machine Learning · Computer Science 2024-05-06 Toon Vanderschueren , Wouter Verbeke , Felipe Moraes , Hugo Manuel Proença

On Modeling Human Perceptions of Allocation Policies with Uncertain Outcomes

Many policies allocate harms or benefits that are uncertain in nature: they produce distributions over the population in which individuals have different probabilities of incurring harm or benefit. Comparing different policies thus involves…

Computers and Society · Computer Science 2021-03-11 Hoda Heidari , Solon Barocas , Jon Kleinberg , Karen Levy

Learning treatment effects while treating those in need

Many social programs attempt to allocate scarce resources to people with the greatest need. Indeed, public services increasingly use algorithmic risk assessments motivated by this goal. However, targeting the highest-need recipients often…

Machine Learning · Computer Science 2025-06-30 Bryan Wilder , Pim Welle

Doubly Optimal Policy Evaluation for Reinforcement Learning

Policy evaluation estimates the performance of a policy by (1) collecting data from the environment and (2) processing raw data into a meaningful estimate. Due to the sequential nature of reinforcement learning, any improper data-collecting…

Machine Learning · Computer Science 2025-03-21 Shuze Daniel Liu , Claire Chen , Shangtong Zhang

Policy Learning with Rare Outcomes

Machine learning (ML) estimates of conditional average treatment effects (CATE) can guide policy decisions, either by allowing targeting of individuals with beneficial CATE estimates, or as inputs to decision trees that optimise overall…

Econometrics · Economics 2023-10-04 Julia Hatamyar , Noemi Kreif

Offline Policy Optimization with Eligible Actions

Offline policy optimization could have a large impact on many real-world decision-making problems, as online learning may be infeasible in many applications. Importance sampling and its variants are a commonly used type of estimator in…

Machine Learning · Computer Science 2022-07-05 Yao Liu , Yannis Flet-Berliac , Emma Brunskill

Optimal Policy Adaptation under Covariate Shift

Transfer learning of prediction models has been extensively studied, while the corresponding policy learning approaches are rarely discussed. In this paper, we propose principled approaches for learning the optimal policy in the target…

Machine Learning · Computer Science 2025-05-20 Xueqing Liu , Qinwei Yang , Zhaoqing Tian , Ruocheng Guo , Peng Wu

Preference-based Learning of Reward Function Features

Preference-based learning of reward functions, where the reward function is learned using comparison data, has been well studied for complex robotic tasks such as autonomous driving. Existing algorithms have focused on learning reward…

Robotics · Computer Science 2021-03-05 Sydney M. Katz , Amir Maleki , Erdem Bıyık , Mykel J. Kochenderfer

Who With Whom? Learning Optimal Matching Policies

There are many economic contexts where the productivity and welfare performance of institutions and policies depend on who matches with whom. Examples include caseworkers and job seekers in job search assistance programs, medical doctors…

Econometrics · Economics 2025-07-21 Yagan Hazard , Toru Kitagawa

Policy Learning with Adaptively Collected Data

Learning optimal policies from historical data enables personalization in a wide variety of applications including healthcare, digital recommendations, and online education. The growing policy learning literature focuses on settings where…

Machine Learning · Statistics 2022-11-17 Ruohan Zhan , Zhimei Ren , Susan Athey , Zhengyuan Zhou

Positivity-free Policy Learning with Observational Data

Policy learning utilizing observational data is pivotal across various domains, with the objective of learning the optimal treatment assignment policy while adhering to specific constraints such as fairness, budget, and simplicity. This…

Methodology · Statistics 2023-10-12 Pan Zhao , Antoine Chambaz , Julie Josse , Shu Yang