random-forest-classifier-diabetes

This project utilizes a random forest classifier model to predict diabetes based on data from the CDC Health Survey. The data is preprocessed to handle missing values, and variables are named and grouped to find correlations. Various models, including Random Forest and XGBoost, are employed and optimized to achieve the best predictive performance.

Note: This project was done on Google Colab

The dataset used: https://www.kaggle.com/datasets/cdc/national-health-and-nutrition-examination-survey

Credits to Toby Anderson for helping with Data preprocessing https://www.kaggle.com/code/tobyanderson/health-survey-analysis

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Diabetes_Random_Foreset_Classifier.ipynb		Diabetes_Random_Foreset_Classifier.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

random-forest-classifier-diabetes

About

Releases

Packages

Languages

LittleOutfox/random-forest-classifier-diabetes

Folders and files

Latest commit

History

Repository files navigation

random-forest-classifier-diabetes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages