[B! ai-class] mnruのブックマーク

申し訳ございません．お探しのページが見つかりませんでした．お探しのページは，移動もしくは削除された可能性があります． Sorry．The page you're looking for can't be found． The page you're looking for have been moved or deleted．村田研究室のWebサイトへようこそ！〒169-8555　東京都新宿区大久保 3-4-1　63号館6F-18 早稲田大学先進理工学研究科電気・情報生命専攻村田昇研究室 Em ail: noboru.murata[at]eb.waseda.ac.jp

mnru 2011/11/16

ai-class

リンク

Temporal difference learning - Wikipedia

Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods.[1] While Monte Carlo methods only adjust their estimates once the final ou

mnru 2011/11/16

ai-class

リンク

http://mikilab.doshisha.ac.jp/dia/research/person/suyara/RL/TD-Learning/

mnru 2011/11/16

ai-class

リンク

強化学習1

申し訳ございません．お探しのページが見つかりませんでした．お探しのページは，移動もしくは削除された可能性があります． Sorry．The page you're looking for can't be found． The page you're looking for have been moved or deleted．村田研究室のWebサイトへようこそ！〒169-8555　東京都新宿区大久保 3-4-1　63号館6F-18 早稲田大学先進理工学研究科電気・情報生命専攻村田昇研究室 Em ail: noboru.murata[at]eb.waseda.ac.jp

mnru 2011/11/16

ai-class

リンク

Reinforcement learning - Wikipedia

Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions in a dynamic environment in order to maximize a reward signal. Reinforcement learning is one of the three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Q-learning at its simplest stores data in tabl

mnru 2011/11/16

ai-class

リンク

PEAS - Wikipedia, the free encyclopedia

mnru 2011/11/15

ai-class

リンク

imHo

http://www.ml-class.org/ ■非線形仮説なぜ新しいアルゴリズムが必要か？ロジスティック回帰だと、特徴点の２乗、３乗を使おうとすると、特徴が多いと組み合わせが爆発するシグモイド関数 ■ニューロンと脳 ■モデル表現 I j段目のネットワークがsj個のユニット、j+1段目がs{j+1}だとすると、ウェイトΘ行列はs{j+1}×(sj + 1)次元になる。 ■モデル表現 II レイヤーが１段だけと考えると、ロジスティック回帰と同じ！ ■例と直感I ニューラルネットワークで論理演算(AND, OR)を組み立てられる。 ■例と直感II NOT, XNOR レイヤーを重ねると複雑な計算が表現できる。 ■多クラス分類１対多を使う最後のアウトプットがクラスの数で、一番大きなものがあてはまると考える。 ■プログラム演習手書きのアラビア数字の認識。特徴は、20x20のピクセル

mnru 2011/11/08

リンク

Artificial Intelligence: A Modern Approach, 4th US ed.

Artificial Intelligence: A Modern Approach, 4th US ed. by Stuart Russell and Peter Norvig The authoritative, most-used AI textbook, adopted by over 1500 schools. Table of Contents for the US Edition (or see the Global Edition) Preface (pdf); Contents with subsections I Artificial Intelligence 1 Introduction ... 1 2 Intelligent Agents ... 36 II Probl em-solving 3 Solving Probl ems by Searching ... 63

mnru 2011/10/12

ai-class

リンク

はてなブックマーク

タグ

関連タグで絞り込む (1)

ai-classに関するmnruのブックマーク (9)

お知らせ

今週のはてなブックマーク数ランキング（2024年11月第2週）

今週のはてなブックマーク数ランキング（2024年11月第1週）

月間はてなブックマーク数ランキング（2024年10月）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス