Twenty-eight-day in-hospital mortality prediction for elderly patients with ischemic stroke in the intensive care unit: Interpretable machine learning models.
Risk stratification of elderly patients with ischemic stroke (IS) who are admitted to the intensive care unit (ICU) remains a challenging task. This study aims to establish and validate predictive models that are based on novel machine learning (ML) algorithms for 28-day in-hospital mortality in elderly patients with IS who were admitted to the ICU.
Data of elderly patients with IS were extracted from the electronic intensive care unit (eICU) Collaborative Research Database (eICU-CRD) records of those elderly patients admitted between 2014 and 2015. All selected participants were randomly divided into two sets: a training set and a validation set in the ratio of 8:2. ML algorithms, such as Naïve Bayes (NB), eXtreme Gradient Boosting (xgboost), and logistic regression (LR), were applied for model construction utilizing 10-fold cross-validation. The performance of models was measured by the area under the receiver operating characteristic curve (AUC) analysis and accuracy. The present study uses interpretable ML methods to provide insight into the model's prediction and outcome using the SHapley Additive exPlanations (SHAP) method.
As regards the population demographics and clinical characteristics, the analysis in the present study included 1,236 elderly patients with IS in the ICU, of whom 164 (13.3%) died during hospitalization. As regards feature selection, a total of eight features were selected for model construction. In the training set, both the xgboost and NB models showed specificity values of 0.989 and 0.767, respectively. In the internal validation set, the xgboost model identified patients who died with an AUC value of 0.733 better than the LR model which identified patients who died with an AUC value of 0.627 or the NB model 0.672.
The xgboost model shows the best predictive performance that predicts mortality in elderly patients with IS in the ICU. By making the ML model explainable, physicians would be able to understand better the reasoning behind the outcome.
Huang J
,Jin W
,Duan X
,Liu X
,Shu T
,Fu L
,Deng J
,Chen H
,Liu G
,Jiang Y
,Liu Z
... -
《Frontiers in Public Health》
Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation.
Timely and accurate outcome prediction plays a critical role in guiding clinical decisions for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. However, interpreting and translating the predictive models into clinical applications are as important as the prediction itself. This study aimed to develop an interpretable machine learning (IML) model that accurately predicts 28-day all-cause mortality in hypertensive ischemic or hemorrhagic stroke patients.
A total of 4,274 hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU in the USA from multicenter cohorts were included in this study to develop and validate the IML model. Five machine learning (ML) models were developed, including artificial neural network (ANN), gradient boosting machine (GBM), eXtreme Gradient Boosting (XGBoost), logistic regression (LR), and support vector machine (SVM), to predict mortality using the MIMIC-IV and eICU-CRD database in the USA. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm. Model performance was evaluated based on the area under the curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV). The ML model with the best predictive performance was selected for interpretability analysis. Finally, the SHapley Additive exPlanations (SHAP) method was employed to evaluate the risk of all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU.
The XGBoost model demonstrated the best predictive performance, with the AUC values of 0.822, 0.739, and 0.700 in the training, test, and external cohorts, respectively. The analysis of feature importance revealed that age, ethnicity, white blood cell (WBC), hyperlipidemia, mean corpuscular volume (MCV), glucose, pulse oximeter oxygen saturation (SpO2), serum calcium, red blood cell distribution width (RDW), blood urea nitrogen (BUN), and bicarbonate were the 11 most important features. The SHAP plots were employed to interpret the XGBoost model.
The XGBoost model accurately predicted 28-day all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. The SHAP method can provide explicit explanations of personalized risk prediction, which can aid physicians in understanding the model.
Huang J
,Chen H
,Deng J
,Liu X
,Shu T
,Yin C
,Duan M
,Fu L
,Wang K
,Zeng S
... -
《Frontiers in Neurology》
Interpretable machine learning for 28-day all-cause in-hospital mortality prediction in critically ill patients with heart failure combined with hypertension: A retrospective cohort study based on medical information mart for intensive care database-IV an
Heart failure (HF) combined with hypertension is an extremely important cause of in-hospital mortality, especially for the intensive care unit (ICU) patients. However, under intense working pressure, the medical staff are easily overwhelmed by the large number of clinical signals generated in the ICU, which may lead to treatment delay, sub-optimal care, or even wrong clinical decisions. Individual risk stratification is an essential strategy for managing ICU patients with HF combined with hypertension. Artificial intelligence, especially machine learning (ML), can develop superior models to predict the prognosis of these patients. This study aimed to develop a machine learning method to predict the 28-day mortality for ICU patients with HF combined with hypertension.
We enrolled all critically ill patients with HF combined with hypertension in the Medical Information Mart for IntensiveCare Database-IV (MIMIC-IV, v.1.4) and the eICU Collaborative Research Database (eICU-CRD) from 2008 to 2019. Subsequently, MIMIC-IV was divided into training cohort and testing cohort in an 8:2 ratio, and eICU-CRD was designated as the external validation cohort. The least absolute shrinkage and selection operator (LASSO) Cox regression with internal tenfold cross-validation was used for data dimension reduction and identifying the most valuable predictive features for 28-day mortality. Based on its accuracy and area under the curve (AUC), the best model in the validation cohort was selected. In addition, we utilized the Shapley Additive Explanations (SHAP) method to highlight the importance of model features, analyze the impact of individual features on model output, and visualize an individual's Shapley values.
A total of 3,458 and 6582 patients with HF combined with hypertension in MIMIC-IV and eICU-CRD were included. The patients, including 1,756 males, had a median (Q1, Q3) age of 75 (65, 84) years. After selection, 22 out of a total of 58 clinical parameters were extracted to develop the machine-learning models. Among four constructed models, the Neural Networks (NN) model performed the best predictive performance with an AUC of 0.764 and 0.674 in the test cohort and external validation cohort, respectively. In addition, a simplified model including seven variables was built based on NN, which also had good predictive performance (AUC: 0.741). Feature importance analysis showed that age, mechanical ventilation (MECHVENT), chloride, bun, anion gap, paraplegia, rdw (RDW), hyperlipidemia, peripheral capillary oxygen saturation (SpO2), respiratory rate, cerebrovascular disease, heart rate, white blood cell (WBC), international normalized ratio (INR), mean corpuscular hemoglobin concentration (MCHC), glucose, AIDS, mean corpuscular volume (MCV), N-terminal pro-brain natriuretic peptide (Npro. BNP), calcium, renal replacement therapy (RRT), and partial thromboplastin time (PTT) were the top 22 features of the NN model with the greatest impact. Finally, after hyperparameter optimization, SHAP plots were employed to make the NN-based model interpretable with an analytical description of how the constructed model visualizes the prediction of death.
We developed a predictive model to predict the 28-day mortality for ICU patients with HF combined with hypertension, which proved superior to the traditional logistic regression analysis. The SHAP method enables machine learning models to be more interpretable, thereby helping clinicians to better understand the reasoning behind the outcome and assess in-hospital outcomes for critically ill patients.
Peng S
,Huang J
,Liu X
,Deng J
,Sun C
,Tang J
,Chen H
,Cao W
,Wang W
,Duan X
,Luo X
,Peng S
... -
《Frontiers in Cardiovascular Medicine》