A group academic project featuring a Cardiovascular Disease (CVD) Risk Prediction App and an Exploratory Data Analysis (EDA) Dashboard. This project utilizes machine learning and data visualization to assess CVD risk based on health metrics.
- π₯ CVD Risk Prediction App: Click Here
- π Final Project Report: Click Here
Cardiovascular diseases (CVD) are among the leading causes of mortality worldwide. This project aims to:
β Predict the likelihood of developing CVD using machine learning models.
β Provide data-driven insights via an interactive dashboard.
β Offer user-friendly visualization for better understanding of CVD risk factors.
- Programming Language: Python π
- Data Analysis & Visualization: Pandas, NumPy, Matplotlib, Seaborn
- Machine Learning: Scikit-learn
- App Development: Streamlit
- Deployment: Streamlit Cloud
π¦ CVD-Risk-Predictor β£ π Data # Contains dataset for analysis β£ π EDA_Notebook # Jupyter notebooks for exploratory data analysis β£ π Models # Machine learning models used for prediction β£ π Streamlit_Apps # Streamlit app source code for prediction and analysis β£ π requirements.txt # Dependencies for running the project β£ π README.md # Project documentation
- Understanding CVD risk factors through visualizations
- Correlation analysis between different health indicators
- Data preprocessing and feature engineering
The app uses logistic regression, random forest, and other classifiers to predict CVD risk. Model performance is evaluated using:
- Accuracy, Precision, Recall, and F1-Score
- ROC Curve & Feature Importance
git clone https://github.com/Shubham-S151/CVD-Risk-Predictor.git cd CVD-Risk-Predictor
pip install -r requirements.txt
streamlit run Streamlit_Apps/app.py