MuZero is the legacy of AlphaZero and learns to play games (and other decision scenarios) without knowing the rules of the game.
Until now, it was only known that it needs 3 NNs instead of one, as it was in AlphaZero.
Now this site gives new and very impressive information with pseudocode and a lot of Python code, how to build this engine:
MuZero, first impression how to code it
Moderators: hgm, Rebel, chrisw
-
- Posts: 59
- Joined: Fri Oct 25, 2019 7:58 pm
- Full name: Michael Taktikos
MuZero, first impression how to code it
_____________________
https://github.com/mtaktikos?tab=repositories
https://github.com/mtaktikos?tab=repositories