Need help with ACER?
Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

About the developer

Kaixhin
215 Stars 39 Forks MIT License 99 Commits 0 Opened issues

Description

Actor-critic with experience replay

Services available

!
?

Need anything else?

Contributors list

# 14,590
vnc
Docker
Python
pytorch
71 commits
# 27,978
Python
c-plus-...
theano
Tensorf...
15 commits
# 86,055
pytorch...
pytorch
actor-c...
Shell
4 commits
# 385,376
Lua
Deep le...
Python
Shell
1 commit
# 426,854
Python
deep-re...
Deep le...
Jupyter...
1 commit

ACER

MIT License

Actor-critic with experience replay (ACER) [1]. Uses batch off-policy updates to improve stability. Trust region updates can be enabled with

--trust-region
. Currently uses full trust region instead of "efficient" trust region (see issue #1).

Run with

python main.py 
. To run asynchronous advantage actor-critic (A3C) [2] (but with a Q-value head), use the
--on-policy
option.

Requirements

To install all dependencies with Anaconda run

conda env create -f environment.yml
and use
source activate acer
to activate the environment.

Results

ACER

Acknowledgements

References

[1] Sample Efficient Actor-Critic with Experience Replay
[2] Asynchronous Methods for Deep Reinforcement Learning

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.