🫀 Heart Disease Prediction & Analysis

Comprehensive Machine Learning Pipeline on Heart Disease UCI Dataset

Project Overview

This project provides a complete Machine Learning pipeline for analyzing, predicting, and visualizing heart disease risks using the UCI Heart Disease dataset. It covers data preprocessing, dimensionality reduction (PCA), supervised & unsupervised models, hyperparameter tuning, and optional web deployment using Streamlit & Ngrok.

Project Objectives

Perform Data Cleaning & Preprocessing (missing values, encoding, scaling).
Apply Dimensionality Reduction using PCA.
Implement Feature Selection using Random Forest, RFE, Chi-Square.
Train Supervised Models:
- Logistic Regression
- Decision Tree
- Random Forest
- Support Vector Machine (SVM)
Apply Unsupervised Learning:
- K-Means Clustering
- Hierarchical Clustering
Optimize models using GridSearchCV & RandomizedSearchCV.
Deploy a Streamlit Web App and use Ngrok for public access.

Dataset

Name: Heart Disease UCI Dataset
Description: Predict the presence or absence of heart disease based on clinical parameters.

Tools & Libraries

Language: python
Libraries:
- pandas, numpy – Data Handling
- matplotlib , Seaborn – Visualization
- sklearn – Machine Learning Models & PCA
- joblib - Save Model as .plk
- streamlit – Interactive Web App
- ngrok – Public URL for Deployment

Project Structure

Heart_Disease_Project/
│── data/
│   ├── heart_disease.csv
│── notebooks/
│   ├── 00_data_collecting.ipynb
│   ├── 01_data_preprocessing.ipynb
│   ├── 02_pca_analysis.ipynb
│   ├── 03_feature_selection.ipynb
│   ├── 04_supervised_learning.ipynb
│   ├── 05_unsupervised_learning.ipynb
│   ├── 06_hyperparameter_tuning.ipynb
│── models/
│   ├── final_model.pkl
│── ui/
│   ├── app.py (Streamlit UI)
│── deployment/
│   ├── ngrok_setup.txt
│── results/
│   ├── evaluation_metrics.txt
│── requirements.txt
│── README.md
│── .gitignore

How to Run

Clone the Repository

git clone https://github.com/basmala-ayman/Heart-Disease.git
cd Heart-Disease

Create the Virtual Environment

python3 -m venv venv

Activate the virtual environment

# On macOS/Linux:
source venv/bin/activate

# On Windows:
venv\Scripts\activate

Install Dependencies

pip install -r requirements.txt

Run Jupyter Notebooks

jupyter notebook

Run the Streamlit Web App

streamlit run ui/app.py

Deploy using Ngrok

Read instructions in deployment/ngrok_setup.txt.

Pipeline Workflow

Data Preprocessing & Cleaning – Handle missing values, encoding, scaling
PCA Analysis – Dimensionality Reduction
Feature Selection – Random Forest, RFE, Chi-Square
Model Training – Logistic Regression, Decision Tree, Random Forest, SVM
Evaluation – Accuracy, Precision, Recall, F1, ROC-AUC
Clustering – K-Means & Hierarchical Clustering
Hyperparameter Tuning – GridSearchCV, RandomizedSearchCV
Deployment (Bonus) – Streamlit & Ngrok

Results & Deliverables

Cleaned Dataset
PCA & Feature Selection Results
Trained Models with Evaluation Metrics
Optimized Model Saved as .pkl
Interactive Streamlit UI
Ngrok Public Access Link

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🫀 Heart Disease Prediction & Analysis

Project Overview

Table of Contents

Project Objectives

Dataset

Tools & Libraries

Project Structure

How to Run

Clone the Repository

Create the Virtual Environment

Activate the virtual environment

Install Dependencies

Run Jupyter Notebooks

Run the Streamlit Web App

Deploy using Ngrok

Pipeline Workflow

Results & Deliverables

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data		data
deployment		deployment
models		models
notebooks		notebooks
results		results
ui		ui
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🫀 Heart Disease Prediction & Analysis

Project Overview

Table of Contents

Project Objectives

Dataset

Tools & Libraries

Project Structure

How to Run

Clone the Repository

Create the Virtual Environment

Activate the virtual environment

Install Dependencies

Run Jupyter Notebooks

Run the Streamlit Web App

Deploy using Ngrok

Pipeline Workflow

Results & Deliverables

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages