Related papers: Solving Rubik's Cube Without Tricky Sampling

CubeTR: Learning to Solve The Rubiks Cube Using Transformers

Since its first appearance, transformers have been successfully used in wide ranging domains from computer vision to natural language processing. Application of transformers in Reinforcement Learning by reformulating it as a sequence…

Machine Learning · Computer Science 2023-10-31 Mustafa Ebrahim Chasmai

Solving a Rubik's Cube Using its Local Graph Structure

The Rubix Cube is a 3-dimensional single-player combination puzzle attracting attention in the reinforcement learning community. A Rubix Cube has six faces and twelve possible actions, leading to a small and unconstrained action space and a…

Artificial Intelligence · Computer Science 2024-08-16 Shunyu Yao , Mitchy Lee

Towards Learning Rubik's Cube with N-tuple-based Reinforcement Learning

This work describes in detail how to learn and solve the Rubik's cube game (or puzzle) in the General Board Game (GBG) learning and playing framework. We cover the cube sizes 2x2x2 and 3x3x3. We describe in detail the cube's state…

Machine Learning · Computer Science 2023-01-31 Wolfgang Konen

Solving the Rubik's Cube Without Human Knowledge

A generally intelligent agent must be able to teach itself how to solve problems in complex domains with minimal human supervision. Recently, deep reinforcement learning algorithms combined with self-play have achieved superhuman…

Artificial Intelligence · Computer Science 2018-05-22 Stephen McAleer , Forest Agostinelli , Alexander Shmakov , Pierre Baldi

Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward Penalties

Constrained Reinforcement Learning has been employed to enforce safety constraints on policy through the use of expected cost constraints. The key challenge is in handling expected cost accumulated using the policy and not just in a single…

Machine Learning · Computer Science 2024-01-17 Hao Jiang , Tien Mai , Pradeep Varakantham , Minh Huy Hoang

Solving Rubik's Cube via Quantum Mechanics and Deep Reinforcement Learning

Rubik's Cube is one of the most famous combinatorial puzzles involving nearly $4.3 \times 10^{19}$ possible configurations. Its mathematical description is expressed by the Rubik's group, whose elements define how its layers rotate. We…

Quantum Physics · Physics 2021-09-16 Sebastiano Corli , Lorenzo Moro , Davide E. Galli , Enrico Prati

Self-Supervision is All You Need for Solving Rubik's Cube

Existing combinatorial search methods are often complex and require some level of expertise. This work introduces a simple and efficient deep learning method for solving combinatorial problems with a predefined goal, represented by Rubik's…

Machine Learning · Computer Science 2023-05-24 Kyo Takano

A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning

We consider the problem of control in the setting of reinforcement learning (RL), where model information is not available. Policy gradient algorithms are a popular solution approach for this problem and are usually shown to converge to a…

Machine Learning · Computer Science 2023-04-24 Mizhaan Prajit Maniyar , Akash Mondal , Prashanth L. A. , Shalabh Bhatnagar

A Machine Learning Approach That Beats Large Rubik's Cubes

The paper proposes a novel machine learning-based approach to the pathfinding problem on extremely large graphs. This method leverages diffusion distance estimation via a neural network and uses beam search for pathfinding. We demonstrate…

Machine Learning · Computer Science 2025-02-20 Alexander Chervov , Kirill Khoruzhii , Nikita Bukhal , Jalal Naghiyev , Vladislav Zamkovoy , Ivan Koltsov , Lyudmila Cheldieva , Arsenii Sychev , Arsenii Lenin , Mark Obozov , Egor Urvanov , Alexey Romanov

Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems

We report a novel, computationally efficient approach for solving hard nonlinear problems of reinforcement learning (RL). Here we combine umbrella sampling, from computational physics/chemistry, with optimal control methods. The approach is…

Machine Learning · Computer Science 2025-02-28 Egor E. Nuzhin , Nikolai V. Brilliantov

Robust Predictable Control

Many of the challenges facing today's reinforcement learning (RL) algorithms, such as robustness, generalization, transfer, and computational efficiency are closely related to compression. Prior work has convincingly argued why minimizing…

Machine Learning · Computer Science 2021-09-08 Benjamin Eysenbach , Ruslan Salakhutdinov , Sergey Levine

A Quantum States Preparation Method Based on Difference-Driven Reinforcement Learning

Due to the large state space of the two-qubit system, and the adoption of ladder reward function in the existing quantum state preparation methods, the convergence speed is slow and it is difficult to prepare the desired target quantum…

Quantum Physics · Physics 2024-05-14 Wenjie Liu , Jing Xu , Bosi Wang

On Solving the Rubik's Cube with Domain-Independent Planners Using Standard Representations

Rubik's Cube (RC) is a well-known and computationally challenging puzzle that has motivated AI researchers to explore efficient alternative representations and problem-solving methods. The ideal situation for planning here is that a problem…

Artificial Intelligence · Computer Science 2023-08-22 Bharath Muppasani , Vishal Pallagani , Biplav Srivastava , Forest Agostinelli

Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks

Improving sample efficiency is central to Reinforcement Learning (RL), especially in environments where the rewards are sparse. Some recent approaches have proposed to specify reward functions as manually designed or learned reward…

Machine Learning · Computer Science 2024-01-26 Shuai Han , Mehdi Dastani , Shihan Wang

Divide-and-Conquer Reinforcement Learning

Standard model-free deep reinforcement learning (RL) algorithms sample a new initial state for each trial, allowing them to optimize policies that can perform well even in highly stochastic environments. However, problems that exhibit…

Machine Learning · Computer Science 2018-04-30 Dibya Ghosh , Avi Singh , Aravind Rajeswaran , Vikash Kumar , Sergey Levine

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic $k$-means…

Machine Learning · Computer Science 2021-12-28 Anton Dereventsov , Ranga Raju Vatsavai , Clayton Webster

Ranking Policy Gradient

Sample inefficiency is a long-lasting problem in reinforcement learning (RL). The state-of-the-art estimates the optimal action values while it usually involves an extensive search over the state-action space and unstable optimization.…

Machine Learning · Computer Science 2019-11-27 Kaixiang Lin , Jiayu Zhou

Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles

Reinforcement learning (RL) methods have been shown to be capable of learning intelligent behavior in rich domains. However, this has largely been done in simulated domains without adequate focus on the process of building the simulator. In…

Machine Learning · Computer Science 2019-10-24 Aditya Modi , Nan Jiang , Ambuj Tewari , Satinder Singh

A Simple Reward-free Approach to Constrained Reinforcement Learning

In constrained reinforcement learning (RL), a learning agent seeks to not only optimize the overall reward but also satisfy the additional safety, diversity, or budget constraints. Consequently, existing constrained RL solutions require…

Machine Learning · Computer Science 2021-07-13 Sobhan Miryoosefi , Chi Jin

Learning Intrinsic Symbolic Rewards in Reinforcement Learning

Learning effective policies for sparse objectives is a key challenge in Deep Reinforcement Learning (RL). A common approach is to design task-related dense rewards to improve task learnability. While such rewards are easily interpreted,…

Machine Learning · Computer Science 2020-10-12 Hassam Sheikh , Shauharda Khadka , Santiago Miret , Somdeb Majumdar