Skip to content

Latest commit

 

History

History
30 lines (19 loc) · 1.25 KB

README.md

File metadata and controls

30 lines (19 loc) · 1.25 KB

Self-Driving Racecar with Proximal Policy Optimization

Solving the OpenAI Gym CarRacing-v0 environment using Proximal Policy Optimization.

Read the full report.

Demo

Video Demo

See the full video demo on YouTube.

Results

After 5000 training steps, the agent achieves a mean score of 909.48±10.30 over 100 episodes. To reproduce the results, run the following commands:

mkdir logs
python demo.py --ckpt extra/final_weights.pt --delay_ms 0

Results from episodes will be saved to logs/episode_rewards.csv.

Implementation Details