web.stanford.edu[B!]新着記事・評価 - はてなブックマーク

『web.stanford.edu』

Simple Alpha Zero
7 users
web.stanford.edu/~surag

This tutorial walks through a synchronous single-thread single-GPU (read malnourished) game-agnostic implementation of the recent AlphaGo Zero paper by DeepMind. It's a beautiful piece of work that trains an agent for the game of Go through pure self-play without any human knowledge except the rules of the game. The methods are fairly simple compared to previous papers by DeepMind, and AlphaGo Zer
- テクノロジー
- 2018/01/25 15:56

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx