Acquisition of chess knowledge in AlphaZero

Thomas McGrath; Andrei Kapishnikov; Nenad Tomašev; Adam Pearce; Martin Wattenberg; Demis Hassabis; Been Kim; Ulrich Paquet; Vladimir Kramnik

doi:10.1073/pnas.2206625119

Acquisition of chess knowledge in AlphaZero

Proc Natl Acad Sci U S A. 2022 Nov 22;119(47):e2206625119. doi: 10.1073/pnas.2206625119. Epub 2022 Nov 14.

Authors

Thomas McGrath¹, Andrei Kapishnikov², Nenad Tomašev¹, Adam Pearce², Martin Wattenberg^{2

3}, Demis Hassabis¹, Been Kim⁴, Ulrich Paquet¹, Vladimir Kramnik⁵

Affiliations

¹ DeepMind, London, United Kingdom.
² Google Brain, Mountain View, CA 94043.
³ School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02134.
⁴ Google Research, Mountain View, CA 94043.
⁵ World Chess Champion, 2000-2007.

Abstract

We analyze the knowledge acquired by AlphaZero, a neural network engine that learns chess solely by playing against itself yet becomes capable of outperforming human chess players. Although the system trains without access to human games or guidance, it appears to learn concepts analogous to those used by human chess players. We provide two lines of evidence. Linear probes applied to AlphaZero's internal state enable us to quantify when and where such concepts are represented in the network. We also describe a behavioral analysis of opening play, including qualitative commentary by a former world chess champion.

Keywords: artificial intelligence; deep learning; interpretability; machine learning; reinforcement learning.

MeSH terms

Humans
Learning
Neural Networks, Computer*
Recreation*