The document discusses applying theory of mind, or the ability to infer the intentions of other agents, to multi-agent reinforcement learning. It introduces three papers that use Bayesian reasoning to model other agents in the Hanabi game. Specifically, the papers develop methods for Bayesian action decoding and simplified action decoding to enable agents to reason about each other's intentions du
![Tech-Circle #18 Pythonではじめる強化学習 OpenAI Gym 体験ハンズオン](https://cdn-ak-scissors.b.st-hatena.com/image/square/f544fbe90c311ed91d75759b7334b0f756975a16/height=288;version=1;width=512/https%3A%2F%2Fcdn.slidesharecdn.com%2Fss_thumbnails%2Ftechcircle18openaigym-161116093241-thumbnail.jpg%3Fwidth%3D640%26height%3D640%26fit%3Dbounds)