a3c-tensorflow

This repo contains python code for replicating the asynchronous advantage actor-critic algorithm as described in https://arxiv.org/pdf/1602.01783.pdf

Requirements

tensorflow
scipy
gym (Atari)
skimage

Training

For training a3c algorithm in BreakoutDeterministic-v3 using 8 parallel actor learner threads execute the following command:

python a3c.py --game BreakoutDeterministic-v3 --num_concurrent 8

Testing

For testing a trained a3c agent execute the folowing command

python a3c.py  --game BreakoutDeterministic-v3 --checkpoint_path path_to_checkpoint --testing True

Results

Below you can find 2 plots of training a3c in Breakout and Pong

Code and Algorithm explanation

Full explanation can be found here: https://papoudakis.github.io/announcements/policy_gradient_a3c/

Resources

https://github.com/miyosuda/async_deep_reinforce

https://github.com/coreylynch/async-rl

https://github.com/muupan/async-rl

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
README.md		README.md
a3c.py		a3c.py
a3c_network.py		a3c_network.py
environment.py		environment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

a3c-tensorflow

Requirements

Training

Testing

Results

Code and Algorithm explanation

Resources

About

Releases

Packages

Languages

papoudakis/a3c-tensorflow

Folders and files

Latest commit

History

Repository files navigation

a3c-tensorflow

Requirements

Training

Testing

Results

Code and Algorithm explanation

Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages