fubar_fooのブックマーク - はてなブックマーク

fubar_foo id:fubar_foo

ブックマーク / www.scholarpedia.org (1)

Policy gradient methods - Scholarpedia
Policy gradient methods are a type of reinforcement learning techniques that rely upon optimizing parametrized policies with respect to the expected return (long-term cumulative reward) by gradient descent. They do not suffer from many of the probl ems that have been marring traditional reinforcement learning approaches such as the lack of guarantees of a value function, the intractability probl em
fubar_foo 2016/05/23
machine learning

reinforcement learning
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx