Diabetes Analysis and Prediction

I carried out analysis and built a predictive model for diabetes diagnoses using the dataset which is originally from the National Institute of Diabetes and Digestive and Kidney Diseases.

Problem

The World Health Organisation stated that about 422 million people worldwide have diabetes, the majority living in low-and middle-income countries, and 1.5 million deaths are directly attributed to diabetes each year. This implies that the fight against diabetes should constitute a global effort. See https://www.who.int/health-topics/diabetes#tab=tab_1

Cause of Diabetes

Diabetes occurs when blood glucose or blood sugar decomes excess in the body. Insulin, a hormone made by the pancreas, helps glucose from food get into your cells to be used for energy.

Diabetes. In Wikipedia. https://en.wikipedia.org/wiki/Diabetes

Approach

Predicted the onset of diabetes based on diagnostic measures. Made use of Machine learning models and Exploratory Data analysis.

Dataset

The dataset used in this pipeline is the Pima Indian Diabetes.

This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.

Dataset features

The datasets consist of several medical predictor (independent) variables and one target (dependent) variable, Outcome. This include:

Pregnancies: Number of times pregnant
Glucose: Plasma glucose concentration a 2 hours in an oral glucose tolerance test
BloodPressure: Diastolic blood pressure (mm Hg)
SkinThickness: Triceps skin fold thickness (mm)
Insulin: 2-Hour serum insulin (mu U/ml)
BMI: Body mass index (weight in kg/(height in m)^2)
DiabetesPedigreeFunction: Diabetes pedigree function
Age: Age (years)
Outcome: Class variable (0 or 1) 268 of 768 are 1, the others are 0

Dataset Link: https://www.kaggle.com/code/javagarm/a-complete-ml-pipeline-tutorial-acu-86/data

credit: Smith, J.W., Everhart, J.E., Dickson, W.C., Knowler, W.C., & Johannes, R.S. (1988). Using the ADAP learning algorithm to forecast the onset of diabetes mellitus. In Proceedings of the Symposium on Computer Applications and Medical Care (pp. 261--265). IEEE Computer Society Press.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
images		images
README.md		README.md
cheatsheet-matplotlib-charts.ipynb		cheatsheet-matplotlib-charts.ipynb
diabetes-app.py		diabetes-app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diabetes Analysis and Prediction

Problem

Cause of Diabetes

Approach

Dataset

Dataset features

About

Releases

Packages

Languages

inuwamobarak/Diabetes-Prediction-App

Folders and files

Latest commit

History

Repository files navigation

Diabetes Analysis and Prediction

Problem

Cause of Diabetes

Approach

Dataset

Dataset features

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages