Evaluating machine learning models for cardiovascular risk prediction: A Shapley Additive Explanations-based approach with statistical testing
Cardiovascular disease (CVD) remains the leading global cause of mortality, underscoring the need for accurate and interpretable prediction models to facilitate early diagnosis. Existing machine learning (ML) approaches often face challenges balancing predictive performance with clinical interpretability, limiting their adoption. This study introduces a structured evaluation framework combining A/B testing with statistical hypothesis validation to rigorously compare ML models for CVD risk prediction. Utilizing a dataset of 1,001 patient records, models including logistic regression, random forest (RF), artificial neural networks, and extreme gradient boosting (XGBoost) were trained and evaluated. Synthetic Minority Oversampling Technique was applied to address class imbalance, while Shapley Additive Explanations (SHAP) provided insights into feature contributions and guided the development of reduced-feature models. Results indicate that RF achieved the highest accuracy (98.5%) and area under the receiver operating characteristic curve (0.9991), whereas XGBoost coupled with SHAP enabled effective feature selection with minimal loss in predictive power. A/B testing demonstrated the trade-offs between model complexity and interpretability, while statistical testing confirmed the significance of performance differences. These findings suggest that interpretable, reduced-feature models may be viable for deployment in resource-limited clinical settings, advancing the integration of artificial intelligence in cardiovascular healthcare.
- Ogunpola A, Saeed F, Basurra S, Albarrak AM, Qasem SN. Machine learning-based predictive models for detection of cardiovascular diseases. Diagnostics (Basel). 2024;14(2):144. doi: 10.3390/diagnostics14020144
- World Health Organization. Cardiovascular Diseases. World Health Organization. Published 2025. Available from: https://www.who.int/health-topics/cardiovasculardiseases#tab=tab_1 [Last accessed on 2026 March 03].
- Das R, Turkoglu I, Sengur A. Effective diagnosis of heart disease through neural networks ensembles. Expert Syst Appl. 2009;36(4):7675-7680. doi: 10.1016/j.eswa.2008.09.013
- Huang G, Li Y, Jameel S, Long Y, Papanastasiou G. From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality? Comput Struct Biotechnol J. 2024;24:362-373. doi: 10.1016/j.csbj.2024.05.004
- TR R, Lilhore UK, M P, Simaiya S, Kaur A, Hamdi M. Predictive analysis of heart diseases with machine learning approaches. Malays J Comput Sci. 2022;2022(spec iss 1):132-148. doi: 10.22452/mjcs.sp2022no1.10
- Bhatt CM, Patel P, Ghetia T, Mazzeo PL. Effective heart disease prediction using machine learning techniques. Algorithms. 2023;16(2):88. doi: 10.3390/a16020088
- Hagan R, Gillan CJ, Mallett F. Comparison of machine learning methods for the classification of cardiovascular disease. Inform Med Unlocked. 2021;24:100606. doi: 10.1016/j.imu.2021.100606
- Yang J, Guan J. A heart disease prediction model based on feature optimization and Smote-Xgboost algorithm. Information. 2022;13(10):475. doi: 10.3390/info13100475
- Sena R, Hamida SB. ACTIVE SMOTE for imbalanced medical data classification. Lect Notes Bus Inf Process. 2024;486:81-97. doi: 10.1007/978-3-031-51664-1_6
- Srivastava S, Upreti K, Shanbhog M. Analysis of cardiovascular diseases prediction using machine learning classification algorithms. In: Proceedings of the 2024 IEEE Conference. IEEE; 2024. doi: 10.1109/accai61061.2024.10601806
- Wang K, et al. Convolution smoothing and non-convex regularization for SVM in high dimensions. Appl Soft Comput. 2024;155:111433. doi: 10.1016/j.asoc.2024.111433
- Flores-Araiza D, Villegas-Jimenez A, Lopez-Tiro F, Gonzalez-Mendoza M. On the link between model performance and causal scoring of medical image explanations. In: 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS). IEEE; 2024:1-8. doi: 10.1109/cbms61543.2024.00009
- Zhao R. Predicting cardiovascular disease using simple machine learning techniques. Highlights Sci Eng Technol. 2024;81:356-362. doi: 10.54097/3f973x84
- Miao H. Logistic regression for cardiovascular diseases prediction by integrating PCA and K-means++. Theor Nat Sci. 2024;38(1):126-132. doi: 10.54254/2753-8818/38/20240569
- Sakho A, Malherbe E, Scornet E. Do we need rebalancing strategies? A theoretical and empirical study around SMOTE and its variants. arXiv. Preprint posted online February 7, 2024. doi: 10.48550/arXiv.2402.03819
- Newaz A, Ahmed N, Haq FS. Survival prediction of heart failure patients using machine learning techniques. Inform Med Unlocked. 2021;26:100772. doi: 10.1016/j.imu.2021.100772
- Salman HA, Kalakech A, Steiti A. Random Forest Algorithm Overview. Babylon J Mach Learn. 2024;2024:69-79. doi: 10.58496/BJML/2024/007
- Thakur R. Explainable AI: developing interpretable deep learning models for medical diagnosis. Int J Multidiscip Res. 2024;6(4). doi: 10.36948/ijfmr.2024.v06i04.25281
- Sadeghi Z, Alizadehsani R, et al. A review of explainable artificial intelligence in healthcare. Comput Electr Eng. 2024;118:109370. doi: 10.1016/j.compeleceng.2024.109370
- Canto JG. Prevalence, Clinical Characteristics, and Mortality Among Patients With Myocardial Infarction Presenting Without Chest Pain. JAMA. 2000;283(24):3223. doi: 10.1001/jama.283.24.3223
- Khera R, Haimovich J, Hurley NC, et al. Use of Machine Learning Models to Predict Death After Acute Myocardial Infarction. JAMA Cardiol. 2021;6(6):633. doi: 10.1001/jamacardio.2021.0122
- Marketou ME, Vlachopoulos C, Hahalis G, et al. Clinical characteristics and management of patients with diabetes mellitus and stable coronary artery disease in daily clinical practice. The SCAD-DM Registry. Hellenic J Cardiol. 2021;62(6):408-415. doi: 10.1016/j.hjc.2020.12.006
- Rim AJ, et al. Concussions are associated with increases in blood pressure and cardiovascular risk in American-style football athletes. JACC Adv. 2025;4(5):101717. doi: 10.1016/j.jacadv.2025.101717
- Gunning D. Explainable Artificial Intelligence (XAI). Defense Advanced Research Projects Agency (DARPA); 2017. Available from: https://www.darpa.mil/program/explainable-artificial-intelligence [Last accessed on 2026 March 03].
- European Commission, High-Level Expert Group on AI. Ethics guidelines for trustworthy AI. Publications Office of the European Union; 2019. Available from: https://www.europarl.europa.eu/cmsdata/196377/AI%20HLEGEthics%20Guidelines%20for%20Trustworthy%20AI.pdf [Last accessed on 2026 March 03].
- Ganesan S, Somasiri N. Enhancing Cardiovascular Disease Prediction: Optimised Feature Selection and Machine Learning Techniques for Improved Accuracy. In: Proceedings of the 10th International Conference on Machine Learning Technologies (ICMLT). IEEE; 2025:55-62. doi: 10.1109/icmlt65785.2025.11193317
- Ganesan S, Somasiri N. Navigating the integration of machine learning in healthcare: challenges, strategies, and ethical considerations. J Comput Cogn Eng. 2025;4(1):8-23. doi: 10.47852/bonviewJCCE42023600
- Cicek V, Babaoglu M, Saylik F, et al. A New Risk Prediction Model for the Assessment of Myocardial Injury in Elderly Patients Undergoing Non-Elective Surgery. J Cardiovasc Dev Dis. 2024;12(1):6. doi: 10.3390/jcdd12010006
- Cicek V, Orhan AL, Saylik F, et al. Predicting short-term mortality in patients with acute pulmonary embolism with deep learning. Circ J. 2025;89(5):602-611. doi: 10.1253/circj.CJ-24-0630
- Yilmaz A, Hayiroglu MI, Salturk S, et al. Machine learning approach on high-risk treadmill exercise test to predict obstructive coronary artery disease by using P, QRS, and T-wave features. Curr Probl Cardiol. 2023;48(2):101482. doi: 10.1016/j.cpcardiol.2022.101482
