Related papers: Causal Deep Reinforcement Learning Using Observati…

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data

Empowered by expressive function approximators such as neural networks, deep reinforcement learning (DRL) achieves tremendous empirical successes. However, learning expressive function approximators requires collecting a large dataset…

Machine Learning · Computer Science 2020-06-23 Lingxiao Wang , Zhuoran Yang , Zhaoran Wang

Deconfounding Reinforcement Learning in Observational Settings

We propose a general formulation for addressing reinforcement learning (RL) problems in settings with observational data. That is, we consider the problem of learning good policies solely from historical data in which unobserved factors…

Machine Learning · Computer Science 2018-12-31 Chaochao Lu , Bernhard Schölkopf , José Miguel Hernández-Lobato

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

We study the offline reinforcement learning (RL) in the face of unmeasured confounders. Due to the lack of online interaction with the environment, offline RL is facing the following two significant challenges: (i) the agent may be…

Machine Learning · Computer Science 2022-09-20 Zuyue Fu , Zhengling Qi , Zhaoran Wang , Zhuoran Yang , Yanxun Xu , Michael R. Kosorok

The Challenges of Exploration for Offline Reinforcement Learning

Offline Reinforcement Learning (ORL) enablesus to separately study the two interlinked processes of reinforcement learning: collecting informative experience and inferring optimal behaviour. The second step has been widely studied in the…

Machine Learning · Computer Science 2022-02-22 Nathan Lambert , Markus Wulfmeier , William Whitney , Arunkumar Byravan , Michael Bloesch , Vibhavari Dasagi , Tim Hertweck , Martin Riedmiller

Causal Reinforcement Learning using Observational and Interventional Data

Learning efficiently a causal model of the environment is a key challenge of model-based RL agents operating in POMDPs. We consider here a scenario where the learning agent has the ability to collect online experiences through direct…

Machine Learning · Computer Science 2021-06-29 Maxime Gasse , Damien Grasset , Guillaume Gaudron , Pierre-Yves Oudeyer

Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement

Reinforcement learning (RL) is a powerful data-driven control method that has been largely explored in autonomous driving tasks. However, conventional RL approaches learn control policies through trial-and-error interactions with the…

Robotics · Computer Science 2021-11-03 Tianyu Shi , Dong Chen , Kaian Chen , Zhaojian Li

Confounding Robust Deep Reinforcement Learning: A Causal Approach

A key task in Artificial Intelligence is learning effective policies for controlling agents in unknown environments to optimize performance measures. Off-policy learning methods, like Q-learning, allow learners to make optimal decisions…

Artificial Intelligence · Computer Science 2025-10-27 Mingxuan Li , Junzhe Zhang , Elias Bareinboim

Learning Optimal and Sample-Efficient Decision Policies with Guarantees

The paradigm of decision-making has been revolutionised by reinforcement learning and deep learning. Although this has led to significant progress in domains such as robotics, healthcare, and finance, the use of RL in practice is…

Machine Learning · Computer Science 2026-02-23 Daqian Shao

Collision Avoidance for Multiple UAVs in Unknown Scenarios with Causal Representation Disentanglement

Deep reinforcement learning (DRL) has achieved remarkable progress in online path planning tasks for multi-UAV systems. However, existing DRL-based methods often suffer from performance degradation when tackling unseen scenarios, since the…

Robotics · Computer Science 2024-07-16 Jiafan Zhuang , Zihao Xia , Gaofei Han , Boxi Wang , Wenji Li , Dongliang Wang , Zhifeng Hao , Ruichu Cai , Zhun Fan

Learning Computational Efficient Bots with Costly Features

Deep reinforcement learning (DRL) techniques have become increasingly used in various fields for decision-making processes. However, a challenge that often arises is the trade-off between both the computational efficiency of the…

Machine Learning · Computer Science 2023-08-21 Anthony Kobanda , Valliappan C. A. , Joshua Romoff , Ludovic Denoyer

Improving Offline Reinforcement Learning with Inaccurate Simulators

Offline reinforcement learning (RL) provides a promising approach to avoid costly online interaction with the real environment. However, the performance of offline RL highly depends on the quality of the datasets, which may cause…

Robotics · Computer Science 2024-05-08 Yiwen Hou , Haoyuan Sun , Jinming Ma , Feng Wu

Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications

There is increasing interest in data-driven approaches for recommending optimal treatment strategies in many chronic disease management and critical care applications. Reinforcement learning methods are well-suited to this sequential…

Machine Learning · Computer Science 2023-06-14 Milashini Nambiar , Supriyo Ghosh , Priscilla Ong , Yu En Chan , Yong Mong Bee , Pavitra Krishnaswamy

Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

Offline reinforcement learning (offline RL), which aims to find an optimal policy from a previously collected static dataset, bears algorithmic difficulties due to function approximation errors from out-of-distribution (OOD) data points. To…

Machine Learning · Computer Science 2021-10-06 Gaon An , Seungyong Moon , Jang-Hyun Kim , Hyun Oh Song

The AdobeIndoorNav Dataset: Towards Deep Reinforcement Learning based Real-world Indoor Robot Visual Navigation

Deep reinforcement learning (DRL) demonstrates its potential in learning a model-free navigation policy for robot visual navigation. However, the data-demanding algorithm relies on a large number of navigation trajectories in training.…

Robotics · Computer Science 2018-02-27 Kaichun Mo , Haoxiang Li , Zhe Lin , Joon-Young Lee

Instabilities of Offline RL with Pre-Trained Neural Representation

In offline reinforcement learning (RL), we seek to utilize offline data to evaluate (or learn) policies in scenarios where the data are collected from a distribution that substantially differs from that of the target policy to be evaluated.…

Machine Learning · Computer Science 2021-03-09 Ruosong Wang , Yifan Wu , Ruslan Salakhutdinov , Sham M. Kakade

De-confounding Representation Learning for Counterfactual Inference on Continuous Treatment via Generative Adversarial Network

Counterfactual inference for continuous rather than binary treatment variables is more common in real-world causal inference tasks. While there are already some sample reweighting methods based on Marginal Structural Model for eliminating…

Machine Learning · Computer Science 2024-07-15 Yonghe Zhao , Qiang Huang , Haolong Zeng , Yun Pen , Huiyan Sun

Offline Reinforcement Learning Hands-On

Offline Reinforcement Learning (RL) aims to turn large datasets into powerful decision-making engines without any online interactions with the environment. This great promise has motivated a large amount of research that hopes to replicate…

Machine Learning · Computer Science 2020-12-01 Louis Monier , Jakub Kmec , Alexandre Laterre , Thomas Pierrot , Valentin Courgeau , Olivier Sigaud , Karim Beguir

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Offline policy learning is aimed at learning decision-making policies using existing datasets of trajectories without collecting additional data. The primary motivation for using reinforcement learning (RL) instead of supervised learning…

Machine Learning · Computer Science 2023-10-13 Zhang-Wei Hong , Aviral Kumar , Sathwik Karnik , Abhishek Bhandwaldar , Akash Srivastava , Joni Pajarinen , Romain Laroche , Abhishek Gupta , Pulkit Agrawal

A Reinforcement Learning Approach for Robotic Unloading from Visual Observations

In this work, we focus on a robotic unloading problem from visual observations, where robots are required to autonomously unload stacks of parcels using RGB-D images as their primary input source. While supervised and imitation learning…

Robotics · Computer Science 2023-09-14 Vittorio Giammarino , Alberto Giammarino , Matthew Pearce

Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

Recent progress in deep learning has relied on access to large and diverse datasets. Such data-driven progress has been less evident in offline reinforcement learning (RL), because offline RL data is usually collected to optimize specific…

Machine Learning · Computer Science 2022-04-07 Denis Yarats , David Brandfonbrener , Hao Liu , Michael Laskin , Pieter Abbeel , Alessandro Lazaric , Lerrel Pinto