Early-Stage-Diabetes-Risk-Prediction

This project uses supervised machine learning techniques to predict the likelihood of early-stage diabetes in patients based on a set of clinical symptoms and demographic data.

Dataset

The dataset is preprocessed through feature encoding, exploratory data analysis (EDA), and feature selection techniques. Key features are selected using chi-squared scoring and variance thresholding.

ML-Model

Multiple classification models are trained and evaluated, including Logistic Regression, Support Vector Machines (linear and RBF kernels), K-Nearest Neighbors, and Gaussian Naive Bayes. Model performance is assessed using accuracy, confusion matrices, cross-validation scores, and ROC curves.

The notebook demonstrates the full pipeline from raw data ingestion to model evaluation, showcasing how machine learning can assist in early medical diagnostics.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Early_risk_Project.py		Early_risk_Project.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Early-Stage-Diabetes-Risk-Prediction

Dataset

ML-Model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Early-Stage-Diabetes-Risk-Prediction

Dataset

ML-Model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages