Related papers: Time manipulation technique for speeding up reinfo…

Deep Reinforcement Learning for Time Optimal Velocity Control using Prior Knowledge

Autonomous navigation has recently gained great interest in the field of reinforcement learning. However, little attention was given to the time optimal velocity control problem, i.e. controlling a vehicle such that it travels at the…

Robotics · Computer Science 2019-07-26 Gabriel Hartmann , Zvi Shiller , Amos Azaria

A Safe and Efficient Self-evolving Algorithm for Decision-making and Control of Autonomous Driving Systems

Autonomous vehicles with a self-evolving ability are expected to cope with unknown scenarios in the real-world environment. Take advantage of trial and error mechanism, reinforcement learning is able to self evolve by learning the optimal…

Robotics · Computer Science 2024-08-23 Shuo Yang , Liwen Wang , Yanjun Huang , Hong Chen

A Learning-Based Computational Impact Time Guidance

This paper investigates the problem of impact-time-control and proposes a learning-based computational guidance algorithm to solve this problem. The proposed guidance algorithm is developed based on a general prediction-correction concept:…

Machine Learning · Computer Science 2021-05-31 Zichao Liu , Jiang Wang , Shaoming He , Hyo-Sang Shin , Antonios Tsourdos

Eligibility Propagation to Speed up Time Hopping for Reinforcement Learning

A mechanism called Eligibility Propagation is proposed to speed up the Time Hopping technique used for faster Reinforcement Learning in simulations. Eligibility Propagation provides for Time Hopping similar abilities to what eligibility…

Artificial Intelligence · Computer Science 2009-04-06 Petar Kormushev , Kohei Nomoto , Fangyan Dong , Kaoru Hirota

Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning

Symmetry is pervasive in robotics and has been widely exploited to improve sample efficiency in deep reinforcement learning (DRL). However, existing approaches primarily focus on spatial symmetries, such as reflection, rotation, and…

Robotics · Computer Science 2025-10-22 Yunpeng Jiang , Jianshu Hu , Paul Weng , Yutong Ban

Backward Curriculum Reinforcement Learning

Current reinforcement learning algorithms train an agent using forward-generated trajectories, which provide little guidance so that the agent can explore as much as possible. While realizing the value of reinforcement learning results from…

Artificial Intelligence · Computer Science 2023-09-06 KyungMin Ko

Aiding reinforcement learning for set point control

While reinforcement learning has made great improvements, state-of-the-art algorithms can still struggle with seemingly simple set-point feedback control problems. One reason for this is that the learned controller may not be able to excite…

Systems and Control · Electrical Eng. & Systems 2023-04-21 Ruoqi Zhang , Per Mattsson , Torbjörn Wigren

Real-Time Scheduling via Reinforcement Learning

Cyber-physical systems, such as mobile robots, must respond adaptively to dynamic operating conditions. Effective operation of these systems requires that sensing and actuation tasks are performed in a timely manner. Additionally, execution…

Machine Learning · Computer Science 2012-03-19 Robert Glaubius , Terry Tidwell , Christopher Gill , William D. Smart

Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions

Using reinforcement learning to learn control policies is a challenge when the task is complex with potentially long horizons. Ensuring adequate but safe exploration is also crucial for controlling physical systems. In this paper, we use…

Machine Learning · Computer Science 2019-03-26 Xiao Li , Calin Belta

Reinforcement Learning Under Probabilistic Spatio-Temporal Constraints with Time Windows

We propose an automata-theoretic approach for reinforcement learning (RL) under complex spatio-temporal constraints with time windows. The problem is formulated using a Markov decision process under a bounded temporal logic constraint.…

Artificial Intelligence · Computer Science 2023-08-01 Xiaoshan Lin , Abbasali Koochakzadeh , Yasin Yazicioglu , Derya Aksaray

Reinforcement Learning Experiments and Benchmark for Solving Robotic Reaching Tasks

Reinforcement learning has shown great promise in robotics thanks to its ability to develop efficient robotic control procedures through self-training. In particular, reinforcement learning has been successfully applied to solving the…

Robotics · Computer Science 2020-11-12 Pierre Aumjaud , David McAuliffe , Francisco Javier Rodríguez Lera , Philip Cardiff

Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system

Reinforcement learning has emerged as a promising methodology for training robot controllers. However, most results have been limited to simulation due to the need for a large number of samples and the lack of automated-yet-safe data…

Robotics · Computer Science 2018-03-29 Kendall Lowrey , Svetoslav Kolev , Jeremy Dao , Aravind Rajeswaran , Emanuel Todorov

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under…

Machine Learning · Computer Science 2026-04-03 Klemens Iten , Bruce Lee , Chenhao Li , Lenart Treven , Andreas Krause , Bhavya Sukhija

Reinforcement Learning with an Abrupt Model Change

The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm…

Systems and Control · Electrical Eng. & Systems 2023-04-25 Wuxia Chen , Taposh Banerjee , Jemin George , Carl Busart

Reinforcement learning for adaptive interior point methods in convex quadratic programming

Quadratic programming is a workhorse of modern nonlinear optimization, control, and data science. Although regularized methods offer convergence guarantees under minimal assumptions on the problem data, they can exhibit the slow…

Optimization and Control · Mathematics 2026-05-18 Jeremy Bertoncini , Alberto De Marchi , Matthias Gerdts , Simon Gottschalk

Forward-Backward Reinforcement Learning

Goals for reinforcement learning problems are typically defined through hand-specified rewards. To design such problems, developers of learning algorithms must inherently be aware of what the task goals are, yet we often require agents to…

Machine Learning · Computer Science 2018-03-29 Ashley D. Edwards , Laura Downs , James C. Davidson

Constrained-Space Optimization and Reinforcement Learning for Complex Tasks

Learning from Demonstration is increasingly used for transferring operator manipulation skills to robots. In practice, it is important to cater for limited data and imperfect human demonstrations, as well as underlying safety constraints.…

Robotics · Computer Science 2020-04-03 Ya-Yen Tsai , Bo Xiao , Edward Johns , Guang-Zhong Yang

Reinforcement Learning for Pivoting Task

In this work we propose an approach to learn a robust policy for solving the pivoting task. Recently, several model-free continuous control algorithms were shown to learn successful policies without prior knowledge of the dynamics of the…

Robotics · Computer Science 2017-03-03 Rika Antonova , Silvia Cruciani , Christian Smith , Danica Kragic

Pseudorehearsal in value function approximation

Catastrophic forgetting is of special importance in reinforcement learning, as the data distribution is generally non-stationary over time. We study and compare several pseudorehearsal approaches for Q-learning with function approximation…

Artificial Intelligence · Computer Science 2017-03-22 Vladimir Marochko , Leonard Johard , Manuel Mazzara

Pseudorehearsal in actor-critic agents

Catastrophic forgetting has a serious impact in reinforcement learning, as the data distribution is generally sparse and non-stationary over time. The purpose of this study is to investigate whether pseudorehearsal can increase performance…

Artificial Intelligence · Computer Science 2017-04-18 Marochko Vladimir , Leonard Johard , Manuel Mazzara