OpenAI Baselines - High-quality implementations of reinforcement learning algorithms.
pytorch-a2c-ppo-acktr-gail - PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pytorch-rl - Model-free deep reinforcement learning algorithms implemented in Pytorch.
reaver - A modular deep reinforcement learning framework with a focus on various StarCraft II based tasks.
RLgraph - Modular computation graphs for deep reinforcement learning.
RLkit - Reinforcement learning framework and algorithms implemented in PyTorch.