The Titanic project

Introduction

Kaggle's Titanic: Machine Learning from Disaster is one of the most popular beginner-friendly projects for learning data science and machine learning. The goal of the project is to predict whether a passenger survived the Titanic shipwreck based on their characteristics, such as age, gender, ticket class, and more.

The raw pages from Kaggle are here.

Implementation

The training process begins with data cleaning and preprocessing, which involves converting strings and filling in missing data. Next, feature extraction is performed on certain variables. After that, the Random Forest algorithm is used to train the model, and the prediction accuracy is evaluated while a confusion matrix is plotted. Finally, the trained model is applied to the test set to output the final results.

Usage

The project can be an excellent experience to embark on the first machine learning project.

Please ensure you have installed several packages below:

pip install pandas
pip install numpy
pip install sklearn

After the installation, you can train the model using commands below:

git clone https://github.com/xiyuanyang-code/Titanic.git
cd Titanic
python process_data.py
python Training.py

Moreover, I suggest to run the Trainingprocess.ipynb in the Visual Studio Code, where you can run every cell independently to implement each function accordingly.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
Training.py		Training.py
Trainingprocess.ipynb		Trainingprocess.ipynb
gender_submission.csv		gender_submission.csv
process_data.py		process_data.py
submission.csv		submission.csv
test.csv		test.csv
test_data.json		test_data.json
train.csv		train.csv
train_data.json		train_data.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Titanic project

Introduction

Table of Contents

Explanations

Implementation

Usage

Advertisement

About

Releases

Packages

Languages

xiyuanyang-code/Titanic

Folders and files

Latest commit

History

Repository files navigation

The Titanic project

Introduction

Table of Contents

Explanations

Implementation

Usage

Advertisement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages