[B! openAi][reinforcementLerning] manboubirdのブックマーク

manboubird id:manboubird

openAiとreinforcementLerningに関するmanboubirdのブックマーク (2)

Tech-Circle #18 Pythonではじめる強化学習 OpenAI Gym 体験ハンズオン
The document discusses applying theory of mind, or the ability to infer the intentions of other agents, to multi-agent reinforcement learning. It introduces three papers that use Bayesian reasoning to model other agents in the Hanabi game. Specifically, the papers develop methods for Bayesian action decoding and simplified action decoding to enable agents to reason about each other's intentions du
manboubird 2017/02/05
reinforcementLerning

deepLearning

slide

openAi
リンク
Gym
Gym is a standard API for reinforcement learning, and a diverse collection of reference environments# The Gym interface is simple, pythonic, and capable of representing general RL probl ems: import gym env = gym.make("LunarLander-v2", render_mode="human") observation, info = env.reset(seed=42) for _ in range(1000): action = policy(observation) # User-defined policy function observation, reward, ter
manboubird 2016/04/30
openAi

oss

gym

reinforcementLerning

deepLearning

python
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx