TalkChess.com

Posted: **Thu Jul 18, 2019 9:46 pm**

Daniel Shawul wrote: ↑Thu Jul 18, 2019 5:24 pm Can it beat this guy though who solves 17x17x17 cube in about 2 hours ? https://www.youtube.com/watch?v=7ChuKKL2PpU

I would like to understand what makes it a unique challenge. From what I understood from the abstract

- single goal state
- solved in reverse with root node being that single goal state (i.e. solved state)
- finds shortest path 60% of the time

Please add more if you read the full paper.

They want $32 for the full pdf, but you can read the first page here:

https://www.nature.com/articles/s42256- ... rer=nature

I did not buy the paper, but from reading the first page:

- They use weighted A* search.
- They use something called "deep approximate value iteration" to train the heuristic function used by the A* algorithm.

In regular value iteration you use a big lookup table and compute the table values by iterating the Bellman equation. (In deterministic cases, like when computing tablebases, you don't even have to iterate more than once if you compute the values in the table in a suitable order.)

If the state space it too big (like for the 3x3x3 cube) you cannot store the whole table. Therefore they instead use "deep approximate value iteration", which means they use a deep neural network as an approximation to the lookup table. The training attempts to minimize the mean squared residual from the Bellman equation. Presumably the mean is taken over the states encountered by starting from the goal position and performing random walks of progressively increasing length.

If you have a complete table you don't need to perform any search to find the solution (like for a tablebase), but since they only have an approximation they use weighted A* to find an actual solution.

Daniel Anulliero wrote: ↑Thu Jul 18, 2019 5:32 pm
Daniel Shawul wrote: ↑Thu Jul 18, 2019 5:24 pm Can it beat this guy though who solves 17x17x17 cube in about 2 hours ? https://www.youtube.com/watch?v=7ChuKKL2PpU
Fake , reversed video

Maybe the fake claim was a joke, but it is clear to me that the video is not reversed. It is not really algorithmically harder to solve a large cube than say a 5x5x5 or a 4x4x4 cube, even though it is practically much harder since it requires many more moves and each move is physically more challenging to perform.

If you take a solved cube and randomly shuffle it, then reverse the video, when you look at the video it would appear that you are making random moves that magically in the end causes the cube to be solved. In contrast, in this video the guy is gradually putting pieces into the right positions and also explains what he is doing while he is doing it.

It would be very interesting to know if their algorithm could actually be trained to learn how to solve a 17x17x17 cube. The state space is very large. Just the inner pieces can be arranged in (24!/4!^6)^56 ~= 10^868 ways. My guess is that their algorithm cannot solve 17x17x17 cubes, or else they probably would have mentioned larger cubes in the abstract.

Posted: **Thu Jul 18, 2019 10:17 pm**

Maybe this link works?

https://www.nature.com/articles/s42256- ... izmodo.com

Posted: **Thu Jul 18, 2019 11:03 pm**

petero2 wrote: ↑Thu Jul 18, 2019 9:46 pm
Daniel Anulliero wrote: ↑Thu Jul 18, 2019 5:32 pm
Daniel Shawul wrote: ↑Thu Jul 18, 2019 5:24 pm Can it beat this guy though who solves 17x17x17 cube in about 2 hours ? https://www.youtube.com/watch?v=7ChuKKL2PpU
Fake , reversed video
Maybe the fake claim was a joke, but it is clear to me that the video is not reversed.

Yeah, it was a joke people kept posting on the video, as a "plot twist" (for people that just finished watching the whole thing and go to read the comments, making them doubt if what they saw was real), but the video is real.

Posted: **Fri Jul 19, 2019 4:12 am**

Thanks Peter.

I did not buy the paper, but from reading the first page:

- They use weighted A* search.
- They use something called "deep approximate value iteration" to train the heuristic function used by the A* algorithm.

In regular value iteration you use a big lookup table and compute the table values by iterating the Bellman equation. (In deterministic cases, like when computing tablebases, you don't even have to iterate more than once if you compute the values in the table in a suitable order.)

If the state space it too big (like for the 3x3x3 cube) you cannot store the whole table. Therefore they instead use "deep approximate value iteration",

Ok so they use value iteration instead of policy iteration. According to David Silver's RL lecture, it is kind of surprizing that they chose that. Wouldn't policy iteration or even actor-critic have been better? but maybe the fact that they are interested in the shortest path has a role in that. Anyway now that we have access to the full paper I will try to go through it and understand better.

Maybe the fake claim was a joke, but it is clear to me that the video is not reversed. It is not really algorithmically harder to solve a large cube than say a 5x5x5 or a 4x4x4 cube, even though it is practically much harder since it requires many more moves and each move is physically more challenging to perform.

The guy is also a 3 time rubik's cube world champ...
My thought when i pointed that video was that the 17x17x17 cube could be intractable problem for them.

Daniel

Posted: **Fri Jul 19, 2019 9:04 am**

petero2 wrote: ↑Thu Jul 18, 2019 9:46 pm
Daniel Shawul wrote: ↑Thu Jul 18, 2019 5:24 pm Can it beat this guy though who solves 17x17x17 cube in about 2 hours ? https://www.youtube.com/watch?v=7ChuKKL2PpU

I would like to understand what makes it a unique challenge. From what I understood from the abstract

- single goal state
- solved in reverse with root node being that single goal state (i.e. solved state)
- finds shortest path 60% of the time

Please add more if you read the full paper.
They want $32 for the full pdf, but you can read the first page here:

https://www.nature.com/articles/s42256- ... rer=nature

I did not buy the paper, but from reading the first page:

- They use weighted A* search.
- They use something called "deep approximate value iteration" to train the heuristic function used by the A* algorithm.

In regular value iteration you use a big lookup table and compute the table values by iterating the Bellman equation. (In deterministic cases, like when computing tablebases, you don't even have to iterate more than once if you compute the values in the table in a suitable order.)

If the state space it too big (like for the 3x3x3 cube) you cannot store the whole table. Therefore they instead use "deep approximate value iteration", which means they use a deep neural network as an approximation to the lookup table. The training attempts to minimize the mean squared residual from the Bellman equation. Presumably the mean is taken over the states encountered by starting from the goal position and performing random walks of progressively increasing length.

If you have a complete table you don't need to perform any search to find the solution (like for a tablebase), but since they only have an approximation they use weighted A* to find an actual solution.

Daniel Anulliero wrote: ↑Thu Jul 18, 2019 5:32 pm
Daniel Shawul wrote: ↑Thu Jul 18, 2019 5:24 pm Can it beat this guy though who solves 17x17x17 cube in about 2 hours ? https://www.youtube.com/watch?v=7ChuKKL2PpU
Fake , reversed video
Maybe the fake claim was a joke, but it is clear to me that the video is not reversed. It is not really algorithmically harder to solve a large cube than say a 5x5x5 or a 4x4x4 cube, even though it is practically much harder since it requires many more moves and each move is physically more challenging to perform.

If you take a solved cube and randomly shuffle it, then reverse the video, when you look at the video it would appear that you are making random moves that magically in the end causes the cube to be solved. In contrast, in this video the guy is gradually putting pieces into the right positions and also explains what he is doing while he is doing it.

It would be very interesting to know if their algorithm could actually be trained to learn how to solve a 17x17x17 cube. The state space is very large. Just the inner pieces can be arranged in (24!/4!^6)^56 ~= 10^868 ways. My guess is that their algorithm cannot solve 17x17x17 cubes, or else they probably would have mentioned larger cubes in the abstract.

What a neat idea. Wish I’ld thought of that. Simple and obvious (once somebody told you, of course). Although, does random stepping back from the start actually only work because Rubik cube has the property that any position is never more than N moves from solution, forget what N is, might be 17, or anyway some manageable number, so the backtracked random positions are going to form a subset of all possible positions, just as long as the backtracker goes to -N depth? If so, highly tractable for a net.
Like everybody, I only read page one. I guess no policy because move ordering for the solving algorithm can be found by a depth one sort on value.

Posted: **Sat Jul 20, 2019 2:18 pm**

Hello Peter:

petero2 wrote: ↑Thu Jul 18, 2019 9:46 pm[...]

It would be very interesting to know if their algorithm could actually be trained to learn how to solve a 17x17x17 cube. The state space is very large. Just the inner pieces can be arranged in (24!/4!^6)^56 ~= 10^868 ways. My guess is that their algorithm cannot solve 17x17x17 cubes, or else they probably would have mentioned larger cubes in the abstract.

I googled the state space of a n x n x n Rubik's cube and I found the following formula that works for n = {2, 3, 4, 5, 6, 7, 8} when compared with the values given in Wikipedia:

Number of Permutations to the Rubik's Cube and variations

Formula

TalkChess.com

Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube.

Re: Self-taught AI solves Rubik's cube.

Re: Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube

Re: Self-taught AI solves Rubik's cube.