Reinforcement-Learning

Homework and Report from the course Reinforcement Learning at Sorbonne University, held by Sylvain Lamprier

We tested several algorithms on diverse environments. For example, we examined the environment "LunarLanderContinuous-v2". There, the algorithm tries to learn landing a flying object. It is created after the video game Lunar Lander. On the following picture, one can see a screenshot of this environment.

We tested the Soft-Actor-Critic Agent on this environment and obtained the following the reward curve:

Hence, we see that the algorithm works very well.

More details on this algorithm and other ones can be found in the report.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
Curriculum Learning		Curriculum Learning
Deep Learning for Reinforcement Learning		Deep Learning for Reinforcement Learning
GAIL		GAIL
GAN		GAN
Images		Images
MADDPG		MADDPG
Normalizing_Flows		Normalizing_Flows
Tabular Q-Learning		Tabular Q-Learning
UCB Algorithm		UCB Algorithm
VAE		VAE
Value Iteration		Value Iteration
README.md		README.md
Report_1.pdf		Report_1.pdf
Report_2.pdf		Report_2.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-Learning

About

Releases

Packages

Languages

alexanderbaumann99/Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages