Dueling Network Architectures for Deep Reinforcement Learning

テクノロジーカテゴリーの変更を依頼記事元:

arxiv.org

7 usersがブックマークコメント

記事へのコメント3件

注目コメント
新着コメント

prototechno #cvsaisentan

2018/02/04 リンク

rishida The dueling architecture of (Wang et al., 2015) has been shown to produce more accurate estimates of Q-values by including separate streams for the state value and advantage in the network.

2016/07/27 リンク

elu_18 深層強化学習で大きな改善，1)状態行動価値関数Q(s, a)をV(s)+A(s, a)に分解し，行動に依存しない推定をつける。2) TDエラーが大きいのから優先度付きサンプリング https://t.co/OjoKnzzth2 https://t.co/U2AKuvHiOb

fromTw

2015/11/27 リンク

注目コメント算出アルゴリズムの一部にLINEヤフー株式会社の「建設的コメント順位付けモデルAPI」を使用しています

規約違反を報告

Dueling Network Architectures for Deep Reinforcement Learning

In recent years there have been many successes of using deep representations in reinforcement lea... In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this paper, we present a new neural network architecture for model-free reinforcement learning. Our dueling network represents two separate estimators: one for the state