Related papers: Enhanced Scene Specificity with Sparse Dynamic Val…

Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning

Training deep reinforcement learning agents on environments with multiple levels / scenes from the same task, has become essential for many applications aiming to achieve generalization and domain transfer from simulation to the real world.…

Machine Learning · Computer Science 2021-02-16 Jaskirat Singh , Liang Zheng

Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning

Training deep reinforcement learning agents on environments with multiple levels / scenes / conditions from the same task, has become essential for many applications aiming to achieve generalization and domain transfer from simulation to…

Machine Learning · Computer Science 2020-05-26 Jaskirat Singh , Liang Zheng

Multi-Frame Self-Supervised Depth Estimation with Multi-Scale Feature Fusion in Dynamic Scenes

Multi-frame methods improve monocular depth estimation over single-frame approaches by aggregating spatial-temporal information via feature matching. However, the spatial-temporal feature leads to accuracy degradation in dynamic scenes. To…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Jiquan Zhong , Xiaolin Huang , Xiao Yu

Shaping Advice in Deep Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning involves multiple agents interacting with each other and a shared environment to complete tasks. When rewards provided by the environment are sparse, agents may not receive immediate feedback on the…

Machine Learning · Computer Science 2021-03-31 Baicen Xiao , Bhaskar Ramasubramanian , Radha Poovendran

Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning

Existing automatic 3D image segmentation methods usually fail to meet the clinic use. Many studies have explored an interactive strategy to improve the image segmentation performance by iteratively incorporating user hints. However, the…

Computer Vision and Pattern Recognition · Computer Science 2019-11-26 Xuan Liao , Wenhao Li , Qisen Xu , Xiangfeng Wang , Bo Jin , Xiaoyun Zhang , Ya Zhang , Yanfeng Wang

Variance Reduced Policy Gradient Method for Multi-Objective Reinforcement Learning

Multi-Objective Reinforcement Learning (MORL) is a generalization of traditional Reinforcement Learning (RL) that aims to optimize multiple, often conflicting objectives simultaneously rather than focusing on a single reward. This approach…

Machine Learning · Computer Science 2025-08-15 Davide Guidobene , Lorenzo Benedetti , Diego Arapovic

Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus

This paper considers offline multi-agent reinforcement learning. We propose the strategy-wise concentration principle which directly builds a confidence interval for the joint strategy, in contrast to the point-wise concentration principle…

Machine Learning · Computer Science 2022-10-17 Qiwen Cui , Simon S. Du

What Should I Know? Using Meta-gradient Descent for Predictive Feature Discovery in a Single Stream of Experience

In computational reinforcement learning, a growing body of work seeks to construct an agent's perception of the world through predictions of future sensations; predictions about environment observations are used as additional input features…

Machine Learning · Computer Science 2022-06-15 Alexandra Kearney , Anna Koop , Johannes Günther , Patrick M. Pilarski

Density-based Curriculum for Multi-goal Reinforcement Learning with Sparse Rewards

Multi-goal reinforcement learning (RL) aims to qualify the agent to accomplish multi-goal tasks, which is of great importance in learning scalable robotic manipulation skills. However, reward engineering always requires strenuous efforts in…

Robotics · Computer Science 2021-09-27 Deyu Yang , Hanbo Zhang , Xuguang Lan , Jishiyu Ding

Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning

Deep reinforcement learning (DRL) has successfully solved various problems recently, typically with a unimodal policy representation. However, grasping distinguishable skills for some tasks with non-unique optima can be essential for…

Machine Learning · Computer Science 2021-04-20 Jie Ren , Yewen Li , Zihan Ding , Wei Pan , Hao Dong

Adaptive Segment-level Reward: Bridging the Gap Between Action and Reward Space in Alignment

Reinforcement Learning (RL) has proven highly effective in aligning Large Language Models (LLMs) with human preferences. Typical RL methods optimize under an overall sequence reward, which can lead to a suboptimal learning process. This…

Machine Learning · Computer Science 2025-02-26 Yanshi Li , Shaopan Xiong , Gengru Chen , Xiaoyang Li , Yijia Luo , Xingyuan Bu , Yingshui Tan , Wenbo Su , Bo Zheng

Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination

Many cooperative multiagent reinforcement learning environments provide agents with a sparse team-based reward, as well as a dense agent-specific reward that incentivizes learning basic skills. Training policies solely on the team-based…

Machine Learning · Computer Science 2020-10-13 Shauharda Khadka , Somdeb Majumdar , Santiago Miret , Stephen McAleer , Kagan Tumer

Automation and Feature Selection Enhancement with Reinforcement Learning (RL)

Effective feature selection, representation and transformation are principal steps in machine learning to improve prediction accuracy, model generalization and computational efficiency. Reinforcement learning provides a new perspective…

Machine Learning · Computer Science 2025-03-18 Sumana Sanyasipura Nagaraju

GMAC: A Distributional Perspective on Actor-Critic Framework

In this paper, we devise a distributional framework on actor-critic as a solution to distributional instability, action type restriction, and conflation between samples and statistics. We propose a new method that minimizes the Cram\'er…

Machine Learning · Computer Science 2021-07-16 Daniel Wontae Nam , Younghoon Kim , Chan Y. Park

Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment

Policy gradient methods are appealing in deep reinforcement learning but suffer from high variance of gradient estimate. To reduce the variance, the state value function is applied commonly. However, the effect of the state value function…

Machine Learning · Computer Science 2021-08-06 Jiaming Guo , Rui Zhang , Xishan Zhang , Shaohui Peng , Qi Yi , Zidong Du , Xing Hu , Qi Guo , Yunji Chen

Domain Knowledge Integration By Gradient Matching For Sample-Efficient Reinforcement Learning

Model-free deep reinforcement learning (RL) agents can learn an effective policy directly from repeated interactions with a black-box environment. However in practice, the algorithms often require large amounts of training experience to…

Machine Learning · Computer Science 2020-05-29 Parth Chadha

Learning Diverse Policies with Soft Self-Generated Guidance

Reinforcement learning (RL) with sparse and deceptive rewards is challenging because non-zero rewards are rarely obtained. Hence, the gradient calculated by the agent can be stochastic and without valid information. Recent studies that…

Machine Learning · Computer Science 2024-02-08 Guojian Wang , Faguo Wu , Xiao Zhang , Jianxiang Liu

Multi-agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

This paper introduces a hybrid algorithm of deep reinforcement learning (RL) and Force-based motion planning (FMP) to solve distributed motion planning problem in dense and dynamic environments. Individually, RL and FMP algorithms each have…

Machine Learning · Computer Science 2020-04-01 Samaneh Hosseini Semnani , Hugh Liu , Michael Everett , Anton de Ruiter , Jonathan P. How

Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty

To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must maintain robustness against environmental uncertainties. While robust RL has been widely studied in single-agent regimes, in multi-agent environments, the…

Machine Learning · Computer Science 2024-05-10 Laixi Shi , Eric Mazumdar , Yuejie Chi , Adam Wierman

Improving On-policy Learning with Statistical Reward Accumulation

Deep reinforcement learning has obtained significant breakthroughs in recent years. Most methods in deep-RL achieve good results via the maximization of the reward signal provided by the environment, typically in the form of discounted…

Machine Learning · Computer Science 2018-09-10 Yubin Deng , Ke Yu , Dahua Lin , Xiaoou Tang , Chen Change Loy