Related papers: DOP: Deep Optimistic Planning with Approximate Val…

Multi-Objective Deep Reinforcement Learning

We propose Deep Optimistic Linear Support Learning (DOL) to solve high-dimensional multi-objective decision problems where the relative importances of the objectives are not known a priori. Using features from the high-dimensional inputs,…

Artificial Intelligence · Computer Science 2016-10-11 Hossam Mossalam , Yannis M. Assael , Diederik M. Roijers , Shimon Whiteson

Q-CP: Learning Action Values for Cooperative Planning

Research on multi-robot systems has demonstrated promising results in manifold applications and domains. Still, efficiently learning an effective robot behaviors is very difficult, due to unstructured scenarios, high uncertainties, and…

Robotics · Computer Science 2018-03-02 Francesco Riccio , Roberto Capobianco , Daniele Nardi

DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning

Safe reinforcement learning is extremely challenging--not only must the agent explore an unknown environment, it must do so while ensuring no safety constraint violations. We formulate this safe reinforcement learning (RL) problem using the…

Machine Learning · Computer Science 2022-10-19 Archana Bura , Aria HasanzadeZonuzy , Dileep Kalathil , Srinivas Shakkottai , Jean-Francois Chamberland

Detect and Act: Automated Dynamic Optimizer through Meta-Black-Box Optimization

Dynamic Optimization Problems (DOPs) are challenging to address due to their complex nature, i.e., dynamic environment variation. Evolutionary Computation methods are generally advantaged in solving DOPs since they resemble dynamic…

Neural and Evolutionary Computing · Computer Science 2026-02-02 Zijian Gao , Yuanting Zhong , Zeyuan Ma , Yue-Jiao Gong , Hongshu Guo

Research on Autonomous Robots Navigation based on Reinforcement Learning

Reinforcement learning continuously optimizes decision-making based on real-time feedback reward signals through continuous interaction with the environment, demonstrating strong adaptive and self-learning capabilities. In recent years, it…

Robotics · Computer Science 2024-08-15 Zixiang Wang , Hao Yan , Yining Wang , Zhengjia Xu , Zhuoyue Wang , Zhizhong Wu

Using Deep Reinforcement Learning for the Continuous Control of Robotic Arms

Deep reinforcement learning enables algorithms to learn complex behavior, deal with continuous action spaces and find good strategies in environments with high dimensional state spaces. With deep reinforcement learning being an active area…

Machine Learning · Computer Science 2018-10-17 Winfried Lötzsch

Regret Minimization for Partially Observable Deep Reinforcement Learning

Deep reinforcement learning algorithms that estimate state and state-action value functions have been shown to be effective in a variety of challenging domains, including learning control strategies from raw image pixels. However,…

Machine Learning · Computer Science 2018-10-26 Peter Jin , Kurt Keutzer , Sergey Levine

Active Robotic Mapping through Deep Reinforcement Learning

We propose an approach to learning agents for active robotic mapping, where the goal is to map the environment as quickly as possible. The agent learns to map efficiently in simulated environments by receiving rewards corresponding to how…

Robotics · Computer Science 2018-01-01 Shane Barratt

Aggressive Perception-Aware Navigation using Deep Optical Flow Dynamics and PixelMPC

Recently, vision-based control has gained traction by leveraging the power of machine learning. In this work, we couple a model predictive control (MPC) framework to a visual pipeline. We introduce deep optical flow (DOF) dynamics, which is…

Robotics · Computer Science 2020-01-09 Keuntaek Lee , Jason Gibson , Evangelos A. Theodorou

Learning and Reasoning for Robot Dialog and Navigation Tasks

Reinforcement learning and probabilistic reasoning algorithms aim at learning from interaction experiences and reasoning with probabilistic contextual knowledge respectively. In this research, we develop algorithms for robot task…

Artificial Intelligence · Computer Science 2020-09-02 Keting Lu , Shiqi Zhang , Peter Stone , Xiaoping Chen

Robot path planning using deep reinforcement learning

Autonomous navigation is challenging for mobile robots, especially in an unknown environment. Commonly, the robot requires multiple sensors to map the environment, locate itself, and make a plan to reach the target. However, reinforcement…

Robotics · Computer Science 2023-03-08 Miguel Quinones-Ramirez , Jorge Rios-Martinez , Victor Uc-Cetina

Scalable Multi-Robot Informative Path Planning for Target Mapping via Deep Reinforcement Learning

Autonomous robots are widely utilized for mapping and exploration tasks due to their cost-effectiveness. Multi-robot systems offer scalability and efficiency, especially in terms of the number of robots deployed in more complex…

Robotics · Computer Science 2025-06-04 Apoorva Vashisth , Manav Kulshrestha , Damon Conover , Aniket Bera

Deep reinforcement learning oriented for real world dynamic scenarios

Autonomous navigation in dynamic environments is a complex but essential task for autonomous robots. Recent deep reinforcement learning approaches show promising results to solve the problem, but it is not solved yet, as they typically…

Robotics · Computer Science 2022-10-21 Diego Martinez , Luis Riazuelo , Luis Montano

Deep Reinforcement Learning with Adjustments

Deep reinforcement learning (RL) algorithms can learn complex policies to optimize agent operation over time. RL algorithms have shown promising results in solving complicated problems in recent years. However, their application on…

Machine Learning · Computer Science 2021-09-29 Hamed Khorasgani , Haiyan Wang , Chetan Gupta , Susumu Serita

Action Pick-up in Dynamic Action Space Reinforcement Learning

Most reinforcement learning algorithms are based on a key assumption that Markov decision processes (MDPs) are stationary. However, non-stationary MDPs with dynamic action space are omnipresent in real-world scenarios. Yet problems of…

Machine Learning · Computer Science 2023-04-04 Jiaqi Ye , Xiaodong Li , Pangjing Wu , Feng Wang

Deep Exploration via Randomized Value Functions

We study the use of randomized value functions to guide deep exploration in reinforcement learning. This offers an elegant means for synthesizing statistically and computationally efficient exploration with common practical approaches to…

Machine Learning · Statistics 2019-09-25 Ian Osband , Benjamin Van Roy , Daniel Russo , Zheng Wen

ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning

Reinforcement learning has become a cornerstone technique for developing reasoning models in complex tasks, ranging from mathematical problem-solving to imaginary reasoning. The optimization of these models typically relies on policy…

Machine Learning · Computer Science 2026-02-11 Qingnan Ren , Shiting Huang , Zhen Fang , Zehui Chen , Lin Chen , Lijun Li , Feng Zhao

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

To overcome the curses of dimensionality and modeling of Dynamic Programming (DP) methods to solve Markov Decision Process (MDP) problems, Reinforcement Learning (RL) methods are adopted in practice. Contrary to traditional RL algorithms…

Machine Learning · Computer Science 2021-08-24 Arghyadip Roy , Vivek Borkar , Abhay Karandikar , Prasanna Chaporkar

Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning

We study the problem of reinforcement learning in infinite-horizon discounted linear Markov decision processes (MDPs), and propose the first computationally efficient algorithm achieving rate-optimal regret guarantees in this setting. Our…

Machine Learning · Computer Science 2026-03-16 Antoine Moulin , Gergely Neu , Luca Viano

Deep Learned Path Planning via Randomized Reward-Linked-Goals and Potential Space Applications

Space exploration missions have seen use of increasingly sophisticated robotic systems with ever more autonomy. Deep learning promises to take this even a step further, and has applications for high-level tasks, like path planning, as well…

Machine Learning · Computer Science 2019-09-16 Tamir Blum , William Jones , Kazuya Yoshida