woithook / A2C-Pytorch-implementations Public

Notifications You must be signed in to change notification settings
Fork 1
Star 1

Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.

1 star 1 fork Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
alg		alg
model		model
README.md		README.md
utils.py		utils.py

Repository files navigation

A2C-Pytorch-implementations

Basic knowledge on A2C algorithm:
Understanding Actor Critic Methods and A2C

Requriement

Python 3.8
Pytorch 1.7.0
OpenAI gym 0.17.3

Navigation

alg/ - Implementation of A2C and PG.
model/ - Neural networks using for RL.
utils.py - Command utility methods to process envs, hand-craft optimizers and so on.

About

Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%