IMPALAはマルチタスク強化学習を実現するため、複数アクターの実行履歴を学習器に集約し、学習器とアクターの方策のずれを吸収する学習手法V-traceを提案。はじめて単一エージェントによるAtariスケールのマルチタスク学

fromTw

elu_18 のブックマーク 2018/02/07 12:23

<blockquote class="hatena-bookmark-comment"><a class="comment-info" href="https://b.hatena.ne.jp/entry/357076853/comment/elu_18" data-user-id="elu_18" data-entry-url="https://b.hatena.ne.jp/entry/s/arxiv.org/abs/1802.01561" data-original-href="https://arxiv.org/abs/1802.01561" data-entry-favicon="https://cdn-ak2.favicon.st-hatena.com/64?url=https%3A%2F%2Farxiv.org%2Fabs%2F1802.01561" data-user-icon="/users/elu_18/profile.png">IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures</a><ul class="comment-tag" style="list-style: none; margin: 0px;"><li style="float: left">[<a href="https://b.hatena.ne.jp/q/fromTw">fromTw</a>]</li></ul><br><p style="clear: left"> IMPALAはマルチタスク強化学習を実現するため、複数アクターの実行履歴を学習器に集約し、学習器とアクターの方策のずれを吸収する学習手法V-traceを提案。はじめて単一エージェントによるAtariスケールのマルチタスク学</p><a class="datetime" href="https://b.hatena.ne.jp/elu_18/20180207#bookmark-357076853"><span class="datetime-body">2018/02/07 12:23</span></a></blockquote><script src="https://b.st-hatena.com/js/comment-widget.js" charset="utf-8" async></script>

このブックマークにはスターがありません。
最初のスターをつけてみよう！

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

arxiv.org2018/02/07

In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and e...

4 人がブックマーク・1 件のコメント

他のコメントを読む

＼コメントがサクサク読めるアプリです／

はてなブックマーク

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

はてなブックマーク

公式Twitter

はてなのサービス