We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months; today’s release includes DQN and three of its variants. Reinforcement learning results are tricky to reproduce: performance is very noisy, algorithms have many moving parts which allow for subtle