Skip to content

Alzheimer's disease detection using XGBoost and other prediction models.

Notifications You must be signed in to change notification settings

simhayn/Binary-Classification

Repository files navigation

Doctor, is he sick? does he have Alzheimer's disease? 😯🩺

Let's see..
well, this algorithm can detect if the patient is sick with 96.6% accuracy
The most relevant features for this xgboost model are as follows:

  1. Functional Assessment (ranging from 0 to 10. Lower scores indicate greater impairment)
  2. ADL (Activities of Daily Living score, ranging from 0 to 10. Lower scores indicate greater impairment)
  3. MMSE (Mini-Mental State Examination score, ranging from 0 to 30. Lower scores indicate cognitive impairment)
  4. Memory Complaints
  5. Behavioral Problems

And there are some more (not so much contributing) features:

  1. Cholesterol HDL
  2. Diet Quality
  3. Sleep Quality
  4. Alcohol Consumption
  5. Cholesterol Triglycerides

the model was fit to patients who had normal levels of cholesterol and blood pressure, in the ages of 60-90 years old.

Binary Classification- Alzheimers

In this project I explored the 🧠 Alzheimers disease dataset downloaded from Kaggle.com

Main steps:

  • Exploratory Data Analysis (EDA): Visualize features' relationships and distributions. Check some statistics.
  • Data Preprocessing: Scale the cumulative features. Covert categorical features to binary.
  • Model Training: Train some models on the preprocessed data (Models: SVM, Logistic Regression, Random Forest, GBoost, Naive Bayes, XGBoost with Random searchCV, Neural Networks). Select 10 features with higher importance to improve the models.
  • Model Evaluation: Evaluate the models' performances. Plot the Neural Networks' validation graph.

Hope you enjoy this project!

There are more projects of binary classification for healthcare datasets in this repo. feel free to check them out!