Precisely Filtered Cuffless Blood Pressure Prediction using Multi-Modal Data from PulseDB

Project Overview

This project focuses on the development and evaluation of various machine learning models for cuffless blood pressure (BP) prediction using a precisely filtered, high-quality subset of the PulseDB dataset. Our research explores different modeling strategies, including building a general model applicable across individuals, fine-tuning this general model for specific individuals, and training dedicated models for each individual. Importantly, our approach leverages a combination of features derived from Photoplethysmography (PPG) signals, and potentially Electrocardiography (ECG) signals and personal information, along with explicitly incorporating relevant cardiovascular-related features within our regression methods.

Dataset

This project utilizes a meticulously curated and precisely filtered subset of the PulseDB dataset. A significant effort has been made to remove noisy data and data contaminated by artifacts, ensuring a high-quality training resource for cuffless blood pressure prediction research. This refined dataset contains PPG signals, and may also include synchronized ECG signals and relevant personal information for each subject.

Methodology

We explore several approaches to predict blood pressure without the need for a traditional cuff, utilizing a blend of input features and regression techniques:

General Model:
- We aim to build a robust machine learning model trained on the entire filtered PulseDB dataset (or a significant portion thereof) to predict blood pressure for a general population.
- This model utilizes a combination of features, primarily derived from PPG signals, and potentially incorporating ECG signals and personal information.
- Crucially, our regression methods also incorporate cardiovascular-related features extracted from the physiological signals, going beyond raw signal data. These features are designed to capture key physiological indicators relevant to blood pressure.
- We will experiment with various regression models, potentially including but not limited to:
  - Traditional machine learning models (e.g., Random Forest, Gradient Boosting).
  - Deep learning models (e.g., Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Transformer networks) designed for time-series data, with architectures adapted to handle multi-modal input and extracted features.
Personalized Fine-tuning:
- We will investigate the effectiveness of fine-tuning the pre-trained general model on the data of individual subjects.
- This approach aims to adapt the general knowledge learned by the model to the specific physiological characteristics of each individual, potentially leading to improved prediction accuracy compared to the general model alone.
Personal Model:
- We will also explore training individual machine learning models for each subject in the filtered PulseDB dataset, using only their respective data.
- This strategy allows the model to learn highly personalized relationships between the blended features (derived from PPG, potentially ECG, personal information, and cardiovascular insights) and blood pressure for each individual.

Input Signals and Features

Our models leverage a combination of information for blood pressure prediction:

Photoplethysmography (PPG): The primary input signal, from which various temporal and morphological features are extracted.
Electrocardiography (ECG): May be used as an additional signal source to derive complementary features related to cardiac timing and function.
Personal Information: Such as age, gender, weight, height, etc., which may have correlations with blood pressure and serve as valuable contextual features.
Cardiovascular-Related Features: Explicitly engineered features derived from the PPG and potentially ECG signals, designed to capture physiological indicators known to be associated with blood pressure regulation.

Expected Outcomes

This project aims to achieve the following:

Develop a robust general machine learning model capable of predicting cuffless blood pressure with high accuracy on the precisely filtered PulseDB dataset, utilizing a blend of signal features and cardiovascular insights.
Demonstrate the significant potential of personalized fine-tuning to enhance blood pressure prediction accuracy for individual subjects.
Evaluate the performance of individual-specific models trained on single-subject data using our multi-feature approach.
Compare the effectiveness of the general model, personalized fine-tuned models, and individual models in the context of cuffless blood pressure prediction.
Contribute to the advancement of precise and reliable non-invasive blood pressure monitoring techniques.

Tools and Technologies

Python: Primary programming language.
Pandas: For data manipulation and analysis.
NumPy: For numerical computations.
Scikit-learn: For implementing traditional machine learning models and evaluation metrics.
TensorFlow or PyTorch: For building and training deep learning models.
Libraries for signal processing: (e.g., SciPy, librosa) for extracting features from PPG and ECG signals.
Libraries for feature engineering: (e.g., custom functions developed for extracting cardiovascular-related features).
Matplotlib and Seaborn: For data visualization and result presentation.

Potential Future Work

Building upon the findings of this project, future research could explore:

Investigating more sophisticated deep learning architectures tailored for multi-modal time-series data and feature fusion.
Developing novel cardiovascular-related features to further improve prediction accuracy.
Exploring the use of explainable AI (XAI) techniques to understand the model's predictions and the importance of different features.
Investigating the robustness and generalizability of the models to unseen populations and different data acquisition settings.
Exploring the integration of these models into real-time wearable blood pressure monitoring systems.

Contact Information

James Lin - AI/ML Algorithm Researcher [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.vscode		.vscode
PulseDB-analysis-main		PulseDB-analysis-main
__pycache__		__pycache__
results		results
runs		runs
.gitattributes		.gitattributes
.gitignore		.gitignore
ECG_segment_autoencoder.py		ECG_segment_autoencoder.py
LICENSE		LICENSE
PulseDB analysis test3.sqlite3		PulseDB analysis test3.sqlite3
README.md		README.md
UNet_1DCNN.py		UNet_1DCNN.py
abp_anomaly_detection_algorithms.py		abp_anomaly_detection_algorithms.py
anomaly_data.npz.npy		anomaly_data.npz.npy
bp_PITN.py		bp_PITN.py
bp_abp_transformer.py		bp_abp_transformer.py
bp_error_statistics.py		bp_error_statistics.py
bp_estimator_model		bp_estimator_model
bp_estimator_vitaldb		bp_estimator_vitaldb
bp_maml_personal_model.py		bp_maml_personal_model.py
bp_model_trainer.py		bp_model_trainer.py
bp_model_trainer_ver2.py		bp_model_trainer_ver2.py
bp_model_with_annotations.py		bp_model_with_annotations.py
bp_multi_signal_fusion_train.py		bp_multi_signal_fusion_train.py
bp_regression_all_methods.py		bp_regression_all_methods.py
bp_regression_personal.py		bp_regression_personal.py
bp_regression_with_rr.py		bp_regression_with_rr.py
bp_representation_pretrain.py		bp_representation_pretrain.py
bp_representation_regression.py		bp_representation_regression.py
bp_resunet_attention_compare.py		bp_resunet_attention_compare.py
bp_simple_personal.py		bp_simple_personal.py
bp_vascular_regression.py		bp_vascular_regression.py
check_h5_content.py		check_h5_content.py
check_mimic_files.py		check_mimic_files.py
check_training_data.py		check_training_data.py
continue_GUI_app.py		continue_GUI_app.py
ecg_rpeak_detector.py		ecg_rpeak_detector.py
evaluate_per_person.py		evaluate_per_person.py
finetune_bp_model.py		finetune_bp_model.py
gen_pulseDB_database.py		gen_pulseDB_database.py
h5_viewer_app.py		h5_viewer_app.py
load_normap_ABP_data.py		load_normap_ABP_data.py
main_GUI_app.py		main_GUI_app.py
main_GUI_app.py 中的 PulseDBViewer 类		main_GUI_app.py 中的 PulseDBViewer 类
model_BP_ratio_trend.py		model_BP_ratio_trend.py
model_abp_1250points.py		model_abp_1250points.py
model_pulse_representation.py		model_pulse_representation.py
model_signal_translation_1250points.py		model_signal_translation_1250points.py
model_structure_1250		model_structure_1250
parser_mat2h5.py		parser_mat2h5.py
personalized_data_preparator_mimic.py		personalized_data_preparator_mimic.py
personalized_data_preparator_vitaldb.py		personalized_data_preparator_vitaldb.py
personalized_model.py		personalized_model.py
personalized_regression_results.csv		personalized_regression_results.csv
ppg_pretrain_contrastive.py		ppg_pretrain_contrastive.py
ppg_pretrain_representation_MAE.py		ppg_pretrain_representation_MAE.py
ppg_pretrain_vae.py		ppg_pretrain_vae.py
ppg_pretrain_vqvae.py		ppg_pretrain_vqvae.py
prepare_training_data.py		prepare_training_data.py
prepare_training_data_mimic.py		prepare_training_data_mimic.py
prepare_training_data_vitaldb.py		prepare_training_data_vitaldb.py
preprocessing.py		preprocessing.py
process_mimic_ecg_peaks.py		process_mimic_ecg_peaks.py
pulsedb_annotations_vital.db-x-file_annotations-1-annotations.bin		pulsedb_annotations_vital.db-x-file_annotations-1-annotations.bin
query.py		query.py
random_forest_6groups_dbp_results_1737444739.csv		random_forest_6groups_dbp_results_1737444739.csv
random_forest_6groups_dbp_results_1739164723.csv		random_forest_6groups_dbp_results_1739164723.csv
random_forest_6groups_results_1737443283.csv		random_forest_6groups_results_1737443283.csv
random_forest_6groups_results_1737443440.csv		random_forest_6groups_results_1737443440.csv
random_forest_6groups_results_experiment_1.csv		random_forest_6groups_results_experiment_1.csv
random_forest_6groups_results_experiment_2.csv		random_forest_6groups_results_experiment_2.csv
random_forest_6groups_results_experiment_3.csv		random_forest_6groups_results_experiment_3.csv
random_forest_6groups_results_experiment_4.csv		random_forest_6groups_results_experiment_4.csv
random_forest_6groups_results_experiment_5.csv		random_forest_6groups_results_experiment_5.csv
random_forest_6groups_sbp_results_1737444633.csv		random_forest_6groups_sbp_results_1737444633.csv
random_forest_6groups_sbp_results_1739164597.csv		random_forest_6groups_sbp_results_1739164597.csv
read_mat.py		read_mat.py
save_annotation_normality_sqlite.py		save_annotation_normality_sqlite.py
sqlite_schema.sql		sqlite_schema.sql
test.py		test.py
test_contrastive_learning.py		test_contrastive_learning.py
test_cuda.py		test_cuda.py
test_fix_h5.py		test_fix_h5.py
test_fix_h5_rr.py		test_fix_h5_rr.py
test_fix_h5_vascular_properties.py		test_fix_h5_vascular_properties.py
test_h5.py		test_h5.py
test_laplace.py		test_laplace.py
unzip_pulsedb2.bat		unzip_pulsedb2.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Precisely Filtered Cuffless Blood Pressure Prediction using Multi-Modal Data from PulseDB

Project Overview

Dataset

Methodology

Input Signals and Features

Expected Outcomes

Tools and Technologies

Potential Future Work

Contact Information

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Precisely Filtered Cuffless Blood Pressure Prediction using Multi-Modal Data from PulseDB

Project Overview

Dataset

Methodology

Input Signals and Features

Expected Outcomes

Tools and Technologies

Potential Future Work

Contact Information

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages