English
Related papers

Related papers: Solving Rubik's Cube Without Tricky Sampling

200 papers

Since its first appearance, transformers have been successfully used in wide ranging domains from computer vision to natural language processing. Application of transformers in Reinforcement Learning by reformulating it as a sequence…

Machine Learning · Computer Science 2023-10-31 Mustafa Ebrahim Chasmai

The Rubix Cube is a 3-dimensional single-player combination puzzle attracting attention in the reinforcement learning community. A Rubix Cube has six faces and twelve possible actions, leading to a small and unconstrained action space and a…

Artificial Intelligence · Computer Science 2024-08-16 Shunyu Yao , Mitchy Lee

This work describes in detail how to learn and solve the Rubik's cube game (or puzzle) in the General Board Game (GBG) learning and playing framework. We cover the cube sizes 2x2x2 and 3x3x3. We describe in detail the cube's state…

Machine Learning · Computer Science 2023-01-31 Wolfgang Konen

A generally intelligent agent must be able to teach itself how to solve problems in complex domains with minimal human supervision. Recently, deep reinforcement learning algorithms combined with self-play have achieved superhuman…

Artificial Intelligence · Computer Science 2018-05-22 Stephen McAleer , Forest Agostinelli , Alexander Shmakov , Pierre Baldi

Constrained Reinforcement Learning has been employed to enforce safety constraints on policy through the use of expected cost constraints. The key challenge is in handling expected cost accumulated using the policy and not just in a single…

Machine Learning · Computer Science 2024-01-17 Hao Jiang , Tien Mai , Pradeep Varakantham , Minh Huy Hoang

Rubik's Cube is one of the most famous combinatorial puzzles involving nearly $4.3 \times 10^{19}$ possible configurations. Its mathematical description is expressed by the Rubik's group, whose elements define how its layers rotate. We…

Quantum Physics · Physics 2021-09-16 Sebastiano Corli , Lorenzo Moro , Davide E. Galli , Enrico Prati

Existing combinatorial search methods are often complex and require some level of expertise. This work introduces a simple and efficient deep learning method for solving combinatorial problems with a predefined goal, represented by Rubik's…

Machine Learning · Computer Science 2023-05-24 Kyo Takano

We consider the problem of control in the setting of reinforcement learning (RL), where model information is not available. Policy gradient algorithms are a popular solution approach for this problem and are usually shown to converge to a…

Machine Learning · Computer Science 2023-04-24 Mizhaan Prajit Maniyar , Akash Mondal , Prashanth L. A. , Shalabh Bhatnagar

The paper proposes a novel machine learning-based approach to the pathfinding problem on extremely large graphs. This method leverages diffusion distance estimation via a neural network and uses beam search for pathfinding. We demonstrate…

We report a novel, computationally efficient approach for solving hard nonlinear problems of reinforcement learning (RL). Here we combine umbrella sampling, from computational physics/chemistry, with optimal control methods. The approach is…

Machine Learning · Computer Science 2025-02-28 Egor E. Nuzhin , Nikolai V. Brilliantov

Many of the challenges facing today's reinforcement learning (RL) algorithms, such as robustness, generalization, transfer, and computational efficiency are closely related to compression. Prior work has convincingly argued why minimizing…

Machine Learning · Computer Science 2021-09-08 Benjamin Eysenbach , Ruslan Salakhutdinov , Sergey Levine

Due to the large state space of the two-qubit system, and the adoption of ladder reward function in the existing quantum state preparation methods, the convergence speed is slow and it is difficult to prepare the desired target quantum…

Quantum Physics · Physics 2024-05-14 Wenjie Liu , Jing Xu , Bosi Wang

Rubik's Cube (RC) is a well-known and computationally challenging puzzle that has motivated AI researchers to explore efficient alternative representations and problem-solving methods. The ideal situation for planning here is that a problem…

Artificial Intelligence · Computer Science 2023-08-22 Bharath Muppasani , Vishal Pallagani , Biplav Srivastava , Forest Agostinelli

Improving sample efficiency is central to Reinforcement Learning (RL), especially in environments where the rewards are sparse. Some recent approaches have proposed to specify reward functions as manually designed or learned reward…

Machine Learning · Computer Science 2024-01-26 Shuai Han , Mehdi Dastani , Shihan Wang

Standard model-free deep reinforcement learning (RL) algorithms sample a new initial state for each trial, allowing them to optimize policies that can perform well even in highly stochastic environments. However, problems that exhibit…

Machine Learning · Computer Science 2018-04-30 Dibya Ghosh , Avi Singh , Aravind Rajeswaran , Vikash Kumar , Sergey Levine

In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic $k$-means…

Machine Learning · Computer Science 2021-12-28 Anton Dereventsov , Ranga Raju Vatsavai , Clayton Webster

Sample inefficiency is a long-lasting problem in reinforcement learning (RL). The state-of-the-art estimates the optimal action values while it usually involves an extensive search over the state-action space and unstable optimization.…

Machine Learning · Computer Science 2019-11-27 Kaixiang Lin , Jiayu Zhou

Reinforcement learning (RL) methods have been shown to be capable of learning intelligent behavior in rich domains. However, this has largely been done in simulated domains without adequate focus on the process of building the simulator. In…

Machine Learning · Computer Science 2019-10-24 Aditya Modi , Nan Jiang , Ambuj Tewari , Satinder Singh

In constrained reinforcement learning (RL), a learning agent seeks to not only optimize the overall reward but also satisfy the additional safety, diversity, or budget constraints. Consequently, existing constrained RL solutions require…

Machine Learning · Computer Science 2021-07-13 Sobhan Miryoosefi , Chi Jin

Learning effective policies for sparse objectives is a key challenge in Deep Reinforcement Learning (RL). A common approach is to design task-related dense rewards to improve task learnability. While such rewards are easily interpreted,…

Machine Learning · Computer Science 2020-10-12 Hassam Sheikh , Shauharda Khadka , Santiago Miret , Somdeb Majumdar
‹ Prev 1 2 3 10 Next ›