In this project I've used Breast Cancer Wisconsin (Diagnostic) Data Set from UCI machine learning repository to train different type of models in order to predict the likeliness of a person having breast cancer.
Attribute Information: (class attribute has been moved to last column) Attribute Domain
- Sample code number id number
- Clump Thickness 1 - 10
- Uniformity of Cell Size 1 - 10
- Uniformity of Cell Shape 1 - 10
- Marginal Adhesion 1 - 10
- Single Epithelial Cell Size 1 - 10
- Bare Nuclei 1 - 10
- Bland Chromatin 1 - 10
- Normal Nucleoli 1 - 10
- Mitoses 1 - 10
- Class: (2 for benign, 4 for malignant)
Algorithm | Test Accuracy |
---|---|
KNN | 0.9562 |
Support Vector Machine | 0.9635 |
Random Forest | 0.9489 |
XGBoost | 0.9562 |