TY - JOUR
T1 - A novel approach for the effective prediction of cardiovascular disease using applied artificial intelligence techniques
AU - Mir, Azka
AU - Ur Rehman, Attique
AU - Ali, Tahir Muhammad
AU - Javaid, Sabeen
AU - Almufareh, Maram Fahaad
AU - Humayun, Mamoona
AU - Shaheen, Momina
PY - 2024/7/11
Y1 - 2024/7/11
N2 - Aims: The objective of this research is to develop an effective cardiovascular disease prediction framework using machine learning techniques and to achieve high accuracy for the prediction of cardiovascular disease. Methods: In this paper, we have utilized machine learning algorithms to predict cardiovascular disease on the basis of symptoms such as chest pain, age and blood pressure. This study incorporated five distinct datasets: Heart UCI, Stroke, Heart Statlog, Framingham and Coronary Heart dataset obtained from online sources. For the implementation of the framework, RapidMiner tool was used. The three‐step approach includes pre‐processing of the dataset, applying feature selection method on pre‐processed dataset and then applying classification methods for prediction of results. We addressed missing values by replacing them with mean, and class imbalance was handled using sample bootstrapping. Various machine learning classifiers were applied out of which random forest with AdaBoost dataset using 10‐fold cross‐validation provided the high accuracy. Results: The proposed model provides the highest accuracy of 99.48% on Heart Statlog, 93.90% on Heart UCI, 96.25% on Stroke dataset, 86% on Framingham dataset and 78.36% on Coronary heart disease dataset, respectively. Conclusions: In conclusion, the results of the study have shown remarkable potential of the proposed framework. By handling imbalance and missing values, a significantly accurate framework has been established that could effectively contribute to the prediction of cardiovascular disease at early stages.
AB - Aims: The objective of this research is to develop an effective cardiovascular disease prediction framework using machine learning techniques and to achieve high accuracy for the prediction of cardiovascular disease. Methods: In this paper, we have utilized machine learning algorithms to predict cardiovascular disease on the basis of symptoms such as chest pain, age and blood pressure. This study incorporated five distinct datasets: Heart UCI, Stroke, Heart Statlog, Framingham and Coronary Heart dataset obtained from online sources. For the implementation of the framework, RapidMiner tool was used. The three‐step approach includes pre‐processing of the dataset, applying feature selection method on pre‐processed dataset and then applying classification methods for prediction of results. We addressed missing values by replacing them with mean, and class imbalance was handled using sample bootstrapping. Various machine learning classifiers were applied out of which random forest with AdaBoost dataset using 10‐fold cross‐validation provided the high accuracy. Results: The proposed model provides the highest accuracy of 99.48% on Heart Statlog, 93.90% on Heart UCI, 96.25% on Stroke dataset, 86% on Framingham dataset and 78.36% on Coronary heart disease dataset, respectively. Conclusions: In conclusion, the results of the study have shown remarkable potential of the proposed framework. By handling imbalance and missing values, a significantly accurate framework has been established that could effectively contribute to the prediction of cardiovascular disease at early stages.
KW - multi‐dataset approach
KW - healthcare applications
KW - data imbalance handling
KW - cardiovascular disease prediction
KW - CVD prediction using machine learning
UR - https://doi.org/10.1002/ehf2.14942
U2 - 10.1002/ehf2.14942
DO - 10.1002/ehf2.14942
M3 - Article
SN - 2055-5822
JO - ESC Heart Failure
JF - ESC Heart Failure
ER -