Recently Google DeepMind announced AlphaGo Zero — an extraordinary achievement that has shown how it is possible to train an agent to a superhuman level in the highly complex and challenging domain of Go, ‘tabula rasa’ — that is, from a blank slate, with no human expert play used as training data.
It thrashed the previous reincarnation 100–0, using only 4TPUs instead of 48TPUs and a single neural network instead of two.
Click on the image to zoom in. To read more and access the full cheat sheet, click here.