Background: Machine learning (ML) employs algorithms that learn from data, building models with the potential to predict events by aggregating a large number of variables and assessing their complex interactions. The aim of this study is to assess ML potential in identifying patients with ischemic heart disease (IHD) at high risk of cardiac death (CD). Methods: 3987 (mean age 68 ± 11) hospitalized IHD patients were enrolled. We implemented and compared various ML models and their combination into ensembles. Model output constitutes a new ML indicator to be employed for stratification. Primary variable importance was assessed with ablation tests. Results: An ensemble classifier combining three ML models achieved the best performance to predict CD (AUROC of 0.830, F1-macro of 0.726). ML indicator use through Cox survival analysis outperformed the 18 variables individually, producing a better stratification compared to standard multivariate analysis (improvement of ~20%). Patients in the low risk group defined through ML indicator had a significantly higher survival (88.8% versus 29.1%). The main variables identified were Dyslipidemia, LVEF, Previous CABG, Diabetes, Previous Myocardial Infarction, Smoke, Documented resting or exertional ischemia, with an AUROC of 0.791 and an F1-score of 0.674, lower than that of 18 variables. Both code and clinical data are freely available with this article. Conclusion: ML may allow a faster, low-cost and reliable evaluation of IHD patient prognosis by inclusion of more predictors and identification of those more significant, improving outcome prediction towards the development of precision medicine in this clinical field.
Machine learning to identify a composite indicator to predict cardiac death in ischemic heart disease
Pingitore, Alessandro;Zhang, Chenxiang;Ferragina, Paolo;Mastorci, Francesca;Sicari, Rosa;Tommasi, Alessandro;Zavattari, Cesare;Prencipe, Giuseppe;Sîrbu, Alina
2024-01-01
Abstract
Background: Machine learning (ML) employs algorithms that learn from data, building models with the potential to predict events by aggregating a large number of variables and assessing their complex interactions. The aim of this study is to assess ML potential in identifying patients with ischemic heart disease (IHD) at high risk of cardiac death (CD). Methods: 3987 (mean age 68 ± 11) hospitalized IHD patients were enrolled. We implemented and compared various ML models and their combination into ensembles. Model output constitutes a new ML indicator to be employed for stratification. Primary variable importance was assessed with ablation tests. Results: An ensemble classifier combining three ML models achieved the best performance to predict CD (AUROC of 0.830, F1-macro of 0.726). ML indicator use through Cox survival analysis outperformed the 18 variables individually, producing a better stratification compared to standard multivariate analysis (improvement of ~20%). Patients in the low risk group defined through ML indicator had a significantly higher survival (88.8% versus 29.1%). The main variables identified were Dyslipidemia, LVEF, Previous CABG, Diabetes, Previous Myocardial Infarction, Smoke, Documented resting or exertional ischemia, with an AUROC of 0.791 and an F1-score of 0.674, lower than that of 18 variables. Both code and clinical data are freely available with this article. Conclusion: ML may allow a faster, low-cost and reliable evaluation of IHD patient prognosis by inclusion of more predictors and identification of those more significant, improving outcome prediction towards the development of precision medicine in this clinical field.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.