English
Related papers

Related papers: Time Hopping technique for faster reinforcement le…

200 papers

A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on…

Artificial Intelligence · Computer Science 2009-03-31 Petar Kormushev , Kohei Nomoto , Fangyan Dong , Kaoru Hirota

A mechanism called Eligibility Propagation is proposed to speed up the Time Hopping technique used for faster Reinforcement Learning in simulations. Eligibility Propagation provides for Time Hopping similar abilities to what eligibility…

Artificial Intelligence · Computer Science 2009-04-06 Petar Kormushev , Kohei Nomoto , Fangyan Dong , Kaoru Hirota

This paper has been withdrawn by the author. This draft is withdrawn for its poor quality in english, unfortunately produced by the author when he was just starting his science route. Look at the ICML version instead:…

Machine Learning · Computer Science 2012-06-11 Yao HengShuai

This paper has been withdrawn by the author because it needs to be rewritten completely.

High Energy Physics - Theory · Physics 2009-08-07 Yan-Gang Miao

This paper has been withdrawn by the author

High Energy Physics - Theory · Physics 2008-01-08 Pulak Ranjan Giri

Temporal difference (TD) methods constitute a class of methods for learning predictions in multi-step prediction problems, parameterized by a recency factor lambda. Currently the most important application of these methods is to temporal…

Artificial Intelligence · Computer Science 2008-02-03 P. Cichosz

This paper has been withdrawn by the author due to similarity to the author's other paper

General Physics · Physics 2009-04-13 Erez M. Yahalomi

This work presents the application of reinforcement learning to improve the performance of a highly dynamic hopping system with a parallel mechanism. Unlike serial mechanisms, parallel mechanisms can not be accurately simulated due to the…

Robotics · Computer Science 2025-01-22 Hongbo Zhang , Xiangyu Chu , Yanlin Chen , Yunxi Tang , Linzhu Yue , Yun-Hui Liu , Kwok Wai Samuel Au

We study the possibility of accommodating both early and late-time tensions using a novel reinforcement learning technique. By applying this technique, we aim to optimize the evolution of the Hubble parameter from recombination to the…

Cosmology and Nongalactic Astrophysics · Physics 2025-04-03 Mohit K. Sharma , M. Sami

Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Sravanti Addepalli , Kaushal Bhogale , Priyam Dey , R. Venkatesh Babu

There is a technical issue in the analysis that is not easily fixable. We, therefore, withdraw the submission. Sorry for the inconvenience.

Artificial Intelligence · Computer Science 2020-06-30 Sham Kakade , Mengdi Wang , Lin F. Yang

Trained models are often composed with post-hoc transforms such as temperature scaling (TS), ensembling and stochastic weight averaging (SWA) to improve performance, robustness, uncertainty estimation, etc. However, such transforms are…

Machine Learning · Computer Science 2024-10-07 Rishabh Ranjan , Saurabh Garg , Mrigank Raman , Carlos Guestrin , Zachary Lipton

Tactical driving decision making is crucial for autonomous driving systems and has attracted considerable interest in recent years. In this paper, we propose several practical components that can speed up deep reinforcement learning…

Artificial Intelligence · Computer Science 2018-02-02 Jingchu Liu , Pengfei Hou , Lisen Mu , Yinan Yu , Chang Huang

Experience replay is one of the most commonly used approaches to improve the sample efficiency of reinforcement learning algorithms. In this work, we propose an approach to select and replay sequences of transitions in order to accelerate…

Artificial Intelligence · Computer Science 2022-09-29 Thommen George Karimpanal , Roland Bouffanais

Current reinforcement learning algorithms train an agent using forward-generated trajectories, which provide little guidance so that the agent can explore as much as possible. While realizing the value of reinforcement learning results from…

Artificial Intelligence · Computer Science 2023-09-06 KyungMin Ko

This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using…

Machine Learning · Computer Science 2026-03-23 Hugo Cazaux , Ralph Rudd , Hlynur Stefánsson , Sverrir Ólafsson , Eyjólfur Ingi Ásgeirsson

This paper has been withdrawn.

Quantum Physics · Physics 2007-05-23 Cristopher Moore

This paper has been withdrawn by the author.

Quantum Physics · Physics 2013-05-29 T. Kaufherr , Y. Aharonov , S. Nussinov , S. Popescu , J. Tollaksen

This paper has been withdrawn by the author(s), due to the existence of a much better paper in http://arxiv.org/abs/cs.CR/0207027

Cryptography and Security · Computer Science 2007-05-23 Boaz Tsaban

This paper has been withdrawn by the author due to a crucial error in the submission action.

Computer Science and Game Theory · Computer Science 2008-09-24 Riccardo Alberti
‹ Prev 1 2 3 10 Next ›