Related papers: State Combinatorial Generalization In Decision Mak…

Is Conditional Generative Modeling all you need for Decision-Making?

Recent improvements in conditional generative modeling have made it possible to generate high-quality images from language descriptions alone. We investigate whether these methods can directly address the problem of sequential…

Machine Learning · Computer Science 2023-07-11 Anurag Ajay , Yilun Du , Abhi Gupta , Joshua Tenenbaum , Tommi Jaakkola , Pulkit Agrawal

Learning General Policies with Policy Gradient Methods

While reinforcement learning methods have delivered remarkable results in a number of settings, generalization, i.e., the ability to produce policies that generalize in a reliable and systematic way, has remained a challenge. The problem of…

Artificial Intelligence · Computer Science 2025-12-23 Simon Ståhlberg , Blai Bonet , Hector Geffner

Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles

Reinforcement learning (RL) methods have been shown to be capable of learning intelligent behavior in rich domains. However, this has largely been done in simulated domains without adequate focus on the process of building the simulator. In…

Machine Learning · Computer Science 2019-10-24 Aditya Modi , Nan Jiang , Ambuj Tewari , Satinder Singh

Generalizing Decision Making for Automated Driving with an Invariant Environment Representation using Deep Reinforcement Learning

Data driven approaches for decision making applied to automated driving require appropriate generalization strategies, to ensure applicability to the world's variability. Current approaches either do not generalize well beyond the training…

Machine Learning · Computer Science 2022-03-11 Karl Kurzer , Philip Schörner , Alexander Albers , Hauke Thomsen , Karam Daaboul , J. Marius Zöllner

Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces

Reinforcement learning (RL) struggles to scale to large, combinatorial action spaces common in many real-world problems. This paper introduces a novel framework for training discrete diffusion models as highly effective policies in these…

Machine Learning · Computer Science 2026-05-21 Haitong Ma , Ofir Nabati , Aviv Rosenberg , Bo Dai , Oran Lang , Craig Boutilier , Na Li , Shie Mannor , Lior Shani , Guy Tenneholtz

Planning with Goal-Conditioned Policies

Planning methods can solve temporally extended sequential decision making problems by composing simple behaviors. However, planning requires suitable abstractions for the states and transitions, which typically need to be designed by hand.…

Machine Learning · Computer Science 2019-11-20 Soroush Nasiriany , Vitchyr H. Pong , Steven Lin , Sergey Levine

Self-Predictive Representations for Combinatorial Generalization in Behavioral Cloning

While goal-conditioned behavior cloning (GCBC) methods can perform well on in-distribution training tasks, they do not necessarily generalize zero-shot to tasks that require conditioning on novel state-goal pairs, i.e. combinatorial…

Machine Learning · Computer Science 2026-04-21 Daniel Lawson , Adriana Hugessen , Charlotte Cloutier , Glen Berseth , Khimya Khetarpal

Diffusion Model-Augmented Behavioral Cloning

Imitation learning addresses the challenge of learning by observing an expert's demonstrations without access to reward signals from environments. Most existing imitation learning methods that do not require interacting with environments…

Machine Learning · Computer Science 2024-06-04 Shang-Fu Chen , Hsiang-Chun Wang , Ming-Hao Hsu , Chun-Mao Lai , Shao-Hua Sun

Adding Conditional Control to Diffusion Models with Reinforcement Learning

Diffusion models are powerful generative models that allow for precise control over the characteristics of the generated samples. While these diffusion models trained on large datasets have achieved success, there is often a need to…

Machine Learning · Computer Science 2025-02-25 Yulai Zhao , Masatoshi Uehara , Gabriele Scalia , Sunyuan Kung , Tommaso Biancalani , Sergey Levine , Ehsan Hajiramezanali

Planning with Diffusion for Flexible Behavior Synthesis

Model-based reinforcement learning methods often use learning only for the purpose of estimating an approximate dynamics model, offloading the rest of the decision-making work to classical trajectory optimizers. While conceptually simple,…

Machine Learning · Computer Science 2022-12-22 Michael Janner , Yilun Du , Joshua B. Tenenbaum , Sergey Levine

Concept-modulated model-based offline reinforcement learning for rapid generalization

The robustness of any machine learning solution is fundamentally bound by the data it was trained on. One way to generalize beyond the original training is through human-informed augmentation of the original dataset; however, it is…

Machine Learning · Computer Science 2022-09-08 Nicholas A. Ketz , Praveen K. Pilly

DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies

Can we use reinforcement learning to learn general-purpose policies that can perform a wide range of different tasks, resulting in flexible and reusable skills? Contextual policies provide this capability in principle, but the…

Machine Learning · Computer Science 2021-04-26 Soroush Nasiriany , Vitchyr H. Pong , Ashvin Nair , Alexander Khazatsky , Glen Berseth , Sergey Levine

Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection

Diffusion Models enable realistic image generation, raising the risk of misinformation and eroding public trust. Currently, detecting images generated by unseen diffusion models remains challenging due to the limited generalization…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Yingjian Chen , Lei Zhang , Yakun Niu , Lei Tan , Pei Chen

SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

We study combinatorial problems with real world applications such as machine scheduling, routing, and assignment. We propose a method that combines Reinforcement Learning (RL) and planning. This method can equally be applied to both the…

Machine Learning · Computer Science 2021-05-19 Joel Oren , Chana Ross , Maksym Lefarov , Felix Richter , Ayal Taitler , Zohar Feldman , Christian Daniel , Dotan Di Castro

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Model-based reinforcement learning (RL) has shown great potential in various control tasks in terms of both sample-efficiency and final performance. However, learning a generalizable dynamics model robust to changes in dynamics remains a…

Machine Learning · Computer Science 2020-10-27 Younggyo Seo , Kimin Lee , Ignasi Clavera , Thanard Kurutach , Jinwoo Shin , Pieter Abbeel

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation

Learning complex manipulation tasks in realistic, obstructed environments is a challenging problem due to hard exploration in the presence of obstacles and high-dimensional visual observations. Prior work tackles the exploration problem by…

Machine Learning · Computer Science 2021-11-12 I-Chun Arthur Liu , Shagun Uppal , Gaurav S. Sukhatme , Joseph J. Lim , Peter Englert , Youngwoon Lee

Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning

Diffusion models are increasingly used as powerful conditional generators, yet real deployments often involve multiple target distributions arising from different tasks, e.g., diverse prompt domains in text-to-image generation, or multiple…

Machine Learning · Computer Science 2026-05-26 Ziheng Cheng , Yixiao Huang , Hanlin Zhu , Haoran Geng , Somayeh Sojoudi , Jitendra Malik , Pieter Abbeel , Xin Guo

Steering Your Diffusion Policy with Latent Space Reinforcement Learning

Robotic control policies learned from human demonstrations have achieved impressive results in many real-world applications. However, in scenarios where initial performance is not satisfactory, as is often the case in novel open-world…

Robotics · Computer Science 2025-06-27 Andrew Wagenmaker , Mitsuhiko Nakamoto , Yunchu Zhang , Seohong Park , Waleed Yagoub , Anusha Nagabandi , Abhishek Gupta , Sergey Levine

On Zero-Shot Reinforcement Learning

Modern reinforcement learning (RL) systems capture deep truths about general, human problem-solving. In domains where new data can be simulated cheaply, these systems uncover sequential decision-making policies that far exceed the ability…

Machine Learning · Computer Science 2025-10-07 Scott Jeen

Lost in Latent Space: Disentangled Models and the Challenge of Combinatorial Generalisation

Recent research has shown that generative models with highly disentangled representations fail to generalise to unseen combination of generative factor values. These findings contradict earlier research which showed improved performance in…

Machine Learning · Computer Science 2024-06-17 Milton L. Montero , Jeffrey S. Bowers , Rui Ponte Costa , Casimir J. H. Ludwig , Gaurav Malhotra