Proximal Policy Optimization

テクノロジーカテゴリーの変更を依頼記事元:

openai.com

6 usersがブックマークコメント

コメント

1

記事へのコメント1件

注目コメント
新着コメント

elu_18 https://t.co/1LvFjyibUV

fromTw

2017/07/22 リンク

注目コメント算出アルゴリズムの一部にLINEヤフー株式会社の「建設的コメント順位付けモデルAPI」を使用しています

規約違反を報告

アプリのスクリーンショット

いまの話題をアプリでチェック！

バナー広告なし
ミュート機能あり
ダークモード搭載

アプリをダウンロード

関連記事

Proximal Policy Optimization

PPO lets us train AI policies in challenging environments, like the Roboschool one shown above wh... PPO lets us train AI policies in challenging environments, like the Roboschool one shown above where an agent tries to reach a target (the pink sphere), learning to walk, run, turn, use its momentum to recover from minor hits, and how to stand up from the ground when it is knocked over. Policy gradient methods are fundamental to recent breakthroughs in using deep neural networks for control, from

機械学習

ブックマークしたユーザー

Gln2023/03/08
elu_182017/07/22
ma__ko__to2017/07/21
lanius2017/07/21

同じサイトの新着

同じサイトの新着をもっと読む

いま人気の記事

いま人気の記事をもっと読む

いま人気の記事 - テクノロジー

いま人気の記事 - テクノロジーをもっと読む

新着記事 - テクノロジー

新着記事 - テクノロジーをもっと読む

設定を変更しましたx