Related papers: Dynamic Value Estimation for Single-Task Multi-Sce…

Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning

Training deep reinforcement learning agents on environments with multiple levels / scenes from the same task, has become essential for many applications aiming to achieve generalization and domain transfer from simulation to the real world.…

Machine Learning · Computer Science 2021-02-16 Jaskirat Singh , Liang Zheng

Enhanced Scene Specificity with Sparse Dynamic Value Estimation

Multi-scene reinforcement learning involves training the RL agent across multiple scenes / levels from the same task, and has become essential for many generalization applications. However, the inclusion of multiple scenes leads to an…

Machine Learning · Computer Science 2020-11-26 Jaskirat Singh , Liang Zheng

An Actor-Critic-Attention Mechanism for Deep Reinforcement Learning in Multi-view Environments

In reinforcement learning algorithms, leveraging multiple views of the environment can improve the learning of complicated policies. In multi-view environments, due to the fact that the views may frequently suffer from partial…

Machine Learning · Computer Science 2019-07-24 Elaheh Barati , Xuewen Chen

Value function estimation using conditional diffusion models for control

A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative…

Machine Learning · Computer Science 2023-06-14 Bogdan Mazoure , Walter Talbott , Miguel Angel Bautista , Devon Hjelm , Alexander Toshev , Josh Susskind

Model-based Reinforcement Learning with Multi-step Plan Value Estimation

A promising way to improve the sample efficiency of reinforcement learning is model-based methods, in which many explorations and evaluations can happen in the learned models to save real-world samples. However, when the learned model has a…

Machine Learning · Computer Science 2022-09-14 Haoxin Lin , Yihao Sun , Jiaji Zhang , Yang Yu

Dynamic Horizon Value Estimation for Model-based Reinforcement Learning

Existing model-based value expansion methods typically leverage a world model for value estimation with a fixed rollout horizon to assist policy learning. However, the fixed rollout with an inaccurate model has a potential to harm the…

Machine Learning · Computer Science 2020-09-22 Junjie Wang , Qichao Zhang , Dongbin Zhao , Mengchen Zhao , Jianye Hao

Fully Decentralized Cooperative Multi-Agent Reinforcement Learning is A Context Modeling Problem

This paper studies fully decentralized cooperative multi-agent reinforcement learning, where each agent solely observes the states, its local actions, and the shared rewards. The inability to access other agents' actions often leads to…

Machine Learning · Computer Science 2026-05-12 Chao Li , Bingkun Bao , Yang Gao

Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning

Multi-agent reinforcement learning has drawn increasing attention in practice, e.g., robotics and automatic driving, as it can explore optimal policies using samples generated by interacting with the environment. However, high reward…

Machine Learning · Computer Science 2022-10-17 Jifeng Hu , Yanchao Sun , Hechang Chen , Sili Huang , haiyin piao , Yi Chang , Lichao Sun

Attention-based Deep Reinforcement Learning for Multi-view Environments

In reinforcement learning algorithms, it is a common practice to account for only a single view of the environment to make the desired decisions; however, utilizing multiple views of the environment can help to promote the learning of…

Machine Learning · Computer Science 2019-05-13 Elaheh Barati , Xuewen Chen , Zichun Zhong

Rethinking Value Function Learning for Generalization in Reinforcement Learning

Our work focuses on training RL agents on multiple visually diverse environments to improve observational generalization performance. In prior methods, policy and value networks are separately optimized using a disjoint network architecture…

Machine Learning · Computer Science 2023-01-10 Seungyong Moon , JunYeong Lee , Hyun Oh Song

Scene-level Pose Estimation for Multiple Instances of Densely Packed Objects

This paper introduces key machine learning operations that allow the realization of robust, joint 6D pose estimation of multiple instances of objects either densely packed or in unstructured piles from RGB-D data. The first objective is to…

Robotics · Computer Science 2019-10-14 Chaitanya Mitash , Bowen Wen , Kostas Bekris , Abdeslam Boularias

Discrete Control in Real-World Driving Environments using Deep Reinforcement Learning

Training self-driving cars is often challenging since they require a vast amount of labeled data in multiple real-world contexts, which is computationally and memory intensive. Researchers often resort to driving simulators to train the…

Artificial Intelligence · Computer Science 2022-12-01 Avinash Amballa , Advaith P. , Pradip Sasmal , Sumohana Channappayya

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

Document-level neural machine translation has yielded attractive improvements. However, majority of existing methods roughly use all context sentences in a fixed scope. They neglect the fact that different source sentences need different…

Computation and Language · Computer Science 2020-10-12 Xiaomian Kang , Yang Zhao , Jiajun Zhang , Chengqing Zong

Deep Reinforcement Learning for Active Human Pose Estimation

Most 3d human pose estimation methods assume that input -- be it images of a scene collected from one or several viewpoints, or from a video -- is given. Consequently, they focus on estimates leveraging prior knowledge and measurement by…

Computer Vision and Pattern Recognition · Computer Science 2020-12-17 Erik Gärtner , Aleksis Pirinen , Cristian Sminchisescu

Learn to Change the World: Multi-level Reinforcement Learning with Model-Changing Actions

Reinforcement learning usually assumes a given or sometimes even fixed environment in which an agent seeks an optimal policy to maximize its long-term discounted reward. In contrast, we consider agents that are not limited to passive…

Machine Learning · Computer Science 2025-10-20 Ziqing Lu , Babak Hassibi , Lifeng Lai , Weiyu Xu

Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning

Existing automatic 3D image segmentation methods usually fail to meet the clinic use. Many studies have explored an interactive strategy to improve the image segmentation performance by iteratively incorporating user hints. However, the…

Computer Vision and Pattern Recognition · Computer Science 2019-11-26 Xuan Liao , Wenhao Li , Qisen Xu , Xiangfeng Wang , Bo Jin , Xiaoyun Zhang , Ya Zhang , Yanfeng Wang

Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients

In this paper, we explore using deep reinforcement learning for problems with multiple agents. Most existing methods for deep multi-agent reinforcement learning consider only a small number of agents. When the number of agents increases,…

Machine Learning · Computer Science 2018-05-24 Arbaaz Khan , Clark Zhang , Daniel D. Lee , Vijay Kumar , Alejandro Ribeiro

Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications

How to improve the ability of scene representation is a key issue in vision-oriented decision-making applications, and current approaches usually learn task-relevant state representations within visual reinforcement learning to address this…

Artificial Intelligence · Computer Science 2024-10-24 Dayang Liang , Jinyang Lai , Yunlong Liu

Deep Reinforcement Learning for 2D Physics-Based Object Manipulation in Clutter

Deep Reinforcement Learning (DRL) is a quickly evolving research field rooted in operations research and behavioural psychology, with potential applications extending across various domains, including robotics. This thesis delineates the…

Robotics · Computer Science 2023-12-11 Luca Renna

Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training

Actor-critic deep reinforcement learning (DRL) algorithms have recently achieved prominent success in tackling various challenging reinforcement learning (RL) problems, particularly complex control tasks with high-dimensional continuous…

Machine Learning · Computer Science 2023-05-04 Gang Chen , Victoria Huang