by keon

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

131 Stars 40 Forks Last release: Not found MIT License 8 Commits 0 Releases

Available items

No Items, yet!

The developer of this repository has not created any items for sale yet. Need a bug fixed? Help with integration? A different license? Create a request here:

Policy Gradient

Minimal implementation of Stochastic Policy Gradient Algorithm in Keras

Pong Agent


This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.


We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.