[B! algorithm][machine learning] hsato2011のブックマーク

hsato2011 id:hsato2011

algorithmとmachine learningに関するhsato2011のブックマーク (2)

State–action–reward–state–action - Wikipedia
State–action–reward–state–action (SARSA) is an algorithm for learning a Markov decision process policy, used in the reinforcement learning area of machine learning. It was proposed by Rummery and Niranjan in a technical note[1] with the name "Modified Connectionist Q-Learning" (MCQ-L). The alternative name SARSA, proposed by Rich Sutton, was only mentioned as a footnote. This name reflects the fac
hsato2011 2016/10/25
algorithm

machine learning

強化学習
リンク
Machine Learning Mastery
Welcome to Machine Learning Mastery! Hi, I’m Jason Brownlee PhD and I help developers like you skip years ahead. Discover how to get better results, faster. Click the button below to get my free eBook and accelerate your next project (and access to my exclusive em ail course). Send Me the Free eBook!
hsato2011 2016/07/13
あとで読む

machine learning

algorithm
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx