OpenAI Baselines is a set of high-quality implementations of reinforcement learning algorithms that're intended to benefit the artificial intelligence research community. They make it easier for researchers to replicate, refine, and identify new ideas that can be explored and developed further.
The OpenAI Baselines were open-sourced by OpenAI in May, 2017. The initial release included DQN -- a reinforcement learning algorithm that combines Q-Learning and deep neural networks to let reinforcement learning work for complex, high-dimensional environments such as video games and robotics -- as well as three of DQN's variants.
OpenAI's stated goal in open-sourcing the OpenAI Baselines is to ensure that apparent reinforcement learning advances are never due to comparisons with buggy or untuned versions of existing algorithms. This is important because reinforcement learning algorithms are highly complex, making results tricky to reproduce. Providing the entire research community with known-good implementations and best practices for creating them is intended to save time implementing pre-existing algorithms so that more time can be spent designing new ones.
OpenAI Baselines: DQN
Syzmon Sidor & John Schulman
Documentaries, videos and podcasts
Walker2d training with OpenAI baselines
July 31st, 2017
- Artificial intelligence (AI)Artificial intelligence (AI) is intelligence exhibited by machines.
- OpenAIOpenAI is a non-profit AI research company, discovering and enacting the path to safe artificial general intelligence.
- Python (programming language)A general purpose high-level programming language
- Deep learningBranch of machine learning based on learning data representations.
- DeepMindDeepMind is an artificial intelligence company creating programs which use deep neural networks to teach themselves how to play a variety of games like Go and chess.
- Show More