Related papers: Time Hopping technique for faster reinforcement le…

Time manipulation technique for speeding up reinforcement learning in simulations

A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on…

Artificial Intelligence · Computer Science 2009-03-31 Petar Kormushev , Kohei Nomoto , Fangyan Dong , Kaoru Hirota

Eligibility Propagation to Speed up Time Hopping for Reinforcement Learning

A mechanism called Eligibility Propagation is proposed to speed up the Time Hopping technique used for faster Reinforcement Learning in simulations. Eligibility Propagation provides for Time Hopping similar abilities to what eligibility…

Artificial Intelligence · Computer Science 2009-04-06 Petar Kormushev , Kohei Nomoto , Fangyan Dong , Kaoru Hirota

Preconditioned Temporal Difference Learning

This paper has been withdrawn by the author. This draft is withdrawn for its poor quality in english, unfortunately produced by the author when he was just starting his science route. Look at the ICML version instead:…

Machine Learning · Computer Science 2012-06-11 Yao HengShuai

Effective Framework of kappa-Minkowski Spacetime and Its Application

This paper has been withdrawn by the author because it needs to be rewritten completely.

High Energy Physics - Theory · Physics 2009-08-07 Yan-Gang Miao

Optimal time evolution in (non)hermitian quantum mechanics

This paper has been withdrawn by the author

High Energy Physics - Theory · Physics 2008-01-08 Pulak Ranjan Giri

Truncating Temporal Differences: On the Efficient Implementation of TD(lambda) for Reinforcement Learning

Temporal difference (TD) methods constitute a class of methods for learning predictions in multi-step prediction problems, parameterized by a recency factor lambda. Currently the most important application of these methods is to temporal…

Artificial Intelligence · Computer Science 2008-02-03 P. Cichosz

Post acceleraed Frames

This paper has been withdrawn by the author due to similarity to the author's other paper

General Physics · Physics 2009-04-13 Erez M. Yahalomi

Learning to Hop for a Single-Legged Robot with Parallel Mechanism

This work presents the application of reinforcement learning to improve the performance of a highly dynamic hopping system with a parallel mechanism. Unlike serial mechanisms, parallel mechanisms can not be accurately simulated due to the…

Robotics · Computer Science 2025-01-22 Hongbo Zhang , Xiangyu Chu , Yanlin Chen , Yunxi Tang , Linzhu Yue , Yun-Hui Liu , Kwok Wai Samuel Au

Reconciling Early and Late Time Tensions with Reinforcement Learning

We study the possibility of accommodating both early and late-time tensions using a novel reinforcement learning technique. By applying this technique, we aim to optimize the evolution of the Hubble parameter from recombination to the…

Cosmology and Nongalactic Astrophysics · Physics 2025-04-03 Mohit K. Sharma , M. Sami

Towards Efficient and Effective Self-Supervised Learning of Visual Representations

Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Sravanti Addepalli , Kaushal Bhogale , Priyam Dey , R. Venkatesh Babu

Variance Reduction Methods for Sublinear Reinforcement Learning

There is a technical issue in the analysis that is not easily fixable. We, therefore, withdraw the submission. Sorry for the inconvenience.

Artificial Intelligence · Computer Science 2020-06-30 Sham Kakade , Mengdi Wang , Lin F. Yang

Post-Hoc Reversal: Are We Selecting Models Prematurely?

Trained models are often composed with post-hoc transforms such as temperature scaling (TS), ensembling and stochastic weight averaging (SWA) to improve performance, robustness, uncertainty estimation, etc. However, such transforms are…

Machine Learning · Computer Science 2024-10-07 Rishabh Ranjan , Saurabh Garg , Mrigank Raman , Carlos Guestrin , Zachary Lipton

Elements of Effective Deep Reinforcement Learning towards Tactical Driving Decision Making

Tactical driving decision making is crucial for autonomous driving systems and has attracted considerable interest in recent years. In this paper, we propose several practical components that can speed up deep reinforcement learning…

Artificial Intelligence · Computer Science 2018-02-02 Jingchu Liu , Pengfei Hou , Lisen Mu , Yinan Yu , Chang Huang

Experience Replay Using Transition Sequences

Experience replay is one of the most commonly used approaches to improve the sample efficiency of reinforcement learning algorithms. In this work, we propose an approach to select and replay sequences of transitions in order to accelerate…

Artificial Intelligence · Computer Science 2022-09-29 Thommen George Karimpanal , Roland Bouffanais

Backward Curriculum Reinforcement Learning

Current reinforcement learning algorithms train an agent using forward-generated trajectories, which provide little guidance so that the agent can explore as much as possible. While realizing the value of reinforcement learning results from…

Artificial Intelligence · Computer Science 2023-09-06 KyungMin Ko

Fine-tuning Timeseries Predictors Using Reinforcement Learning

This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using…

Machine Learning · Computer Science 2026-03-23 Hugo Cazaux , Ralph Rudd , Hlynur Stefánsson , Sverrir Ólafsson , Eyjólfur Ingi Ásgeirsson

Another Way to Perform the Quantum Fourier Transform in Linear Parallel Time

This paper has been withdrawn.

Quantum Physics · Physics 2007-05-23 Cristopher Moore

A Novel Phase Shift Acquired due to Virtual Forces

This paper has been withdrawn by the author.

Quantum Physics · Physics 2013-05-29 T. Kaufherr , Y. Aharonov , S. Nussinov , S. Popescu , J. Tollaksen

Pseudorandom permutations with the fast forward property

This paper has been withdrawn by the author(s), due to the existence of a much better paper in http://arxiv.org/abs/cs.CR/0207027

Cryptography and Security · Computer Science 2007-05-23 Boaz Tsaban

Temporized Equilibria

This paper has been withdrawn by the author due to a crucial error in the submission action.

Computer Science and Game Theory · Computer Science 2008-09-24 Riccardo Alberti