Master-Thesis

Repository for the code of my Master's Thesis @ University of Pisa

Abstract

Reinforcement Learning in recent years has reached astonishing results exploiting huge and complex deep architectures. However, this has come at the cost of unsustainable computational efforts. A common characteristic of all state of art approaches, common in the majority of Machine Learning algorithms, is that the agent’s network learns to solve the task “from scratch”, that is from a randomized initialization, without reusing previously learned skills or doing it only to a very limited extent. In order to challenge the problem of transfer and reuse, we propose a new approach called Skilled Deep Q-Learning, which leverages pre-trained unsupervised skills as agents’ prior knowledge. In the first part of the work, we discuss the implementation of this approach comparing its performance using the Atari suite and investigate how the agent uses these skills. In the second part, we focus on Continual Reinforcement Learning scenarios, trying to extend the proposed approach in a setting where the Reinforcement Learning agent learns more than one game simultaneously. Finally, we present various research paths that can be explored to further develop, understand and improve the proposed approach.

Code Structure

The repository reports the code for all the models using PyTorch. In particular, each folder - based on its name - contains the implementation of a particular architecture:

frame_synthesis: Liu, Ziwei, et al. "Video frame synthesis using deep voxel flow." Proceedings of the IEEE international conference on computer vision. 2017.
keypoints_transporter: Kulkarni, Tejas D., et al. "Unsupervised learning of object keypoints for perception and control." Advances in neural information processing systems 32 (2019).
progressive_nn: Rusu, Andrei A., et al. "Progressive neural networks." arXiv preprint arXiv:1606.04671 (2016).
state_representation: Anand, Ankesh, et al. "Unsupervised state representation learning in atari." Advances in neural information processing systems 32 (2019).
video_object_segmentation: Goel, Vikash, Jameson Weng, and Pascal Poupart. "Unsupervised video object segmentation for deep reinforcement learning." Advances in neural information processing systems 31 (2018).

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
frame_synthesis		frame_synthesis
keypoints_transporter		keypoints_transporter
progressive_nn		progressive_nn
rom		rom
state_representation		state_representation
th-env		th-env
video_object_segmentation		video_object_segmentation
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
test_gym.py		test_gym.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Master-Thesis

Repository for the code of my Master's Thesis @ University of Pisa

Abstract

Code Structure

About

Releases

Packages

Languages

EliaPiccoli/Master-Thesis

Folders and files

Latest commit

History

Repository files navigation

Master-Thesis

Repository for the code of my Master's Thesis @ University of Pisa

Abstract

Code Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages