Skip to content

omidcodes/ML-heart-disease-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

❤️ Heart Disease Prediction using Machine Learning

This project analyzes health survey data from the Heart Disease Health Indicators dataset to predict the likelihood of heart disease.

The notebook walks through data preprocessing, exploratory data analysis, and training multiple classification models including:

  • Random Forest
  • XGBoost
  • Support Vector Machine (SVM)
  • K-Nearest Neighbors (KNN)
  • Decision Tree
  • Gradient Boost
  • Stacked Ensemble
  • Neural Network

Model performance is evaluated using:

  • Accuracy
  • ROC-AUC
  • Confusion Matrix
  • Classification Report

📂 Files

  • Heart_Diseases_project_V_01.ipynb — Main Jupyter Notebook with data exploration, preprocessing, model training, and evaluation.
  • heart_disease_health_indicators.csv — Dataset file (download from Kaggle and place in this folder).
  • requirements.txt — Python dependencies.
  • README.md — Project description and usage.

▶️ How to Run

  1. Download the dataset from Kaggle and save it as heart_disease_health_indicators.csv in this folder.
  2. Install the dependencies:
pip install -r requirements.txt
  1. Open the notebook:
jupyter notebook Heart_Diseases_project_V_01.ipynb
  1. Run all cells to reproduce the analysis.

🙌 Acknowledgements

  • Kaggle dataset by Alex Teboul
  • scikit-learn, XGBoost, pandas, matplotlib, seaborn

About

Machine learning project using the Kaggle Heart Disease Health Indicators dataset to predict heart disease risk. Covers preprocessing, EDA, and training models (Logistic Regression, Random Forest, XGBoost, SVM, KNN, Decision Tree, Naive Bayes) with evaluation via Accuracy, ROC-AUC, and Confusion Matrix.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors