Predicting in-hospital mortality in adult non-traumatic emergency department patients: a retrospective comparison of the Modified Early Warning Score (MEWS) and machine learning approach

Sep 13, 2021PeerJ

Predicting hospital death in adults with non-injury emergencies: comparing a warning score and machine learning

AI simplified

Abstract

The stacking machine learning model achieved an of 0.939 for predicting 6-hour in-hospital mortality.

  • The study involved 24,373,326 emergency department visits after excluding 182,001 cases.
  • For 6, 24, 72, and 168 hours in-hospital mortality, the AUROC for was 0.897, 0.865, 0.841, and 0.816, respectively.
  • The stacking machine learning model outperformed MEWS across all time frames for in-hospital mortality prediction.
  • The (AUPRC) for MEWS dropped below 0.1 for predictions beyond 48 hours.
  • The machine learning model demonstrated statistically significant improvements over MEWS in both AUROC and AUPRC values (all p < 0.001).

AI simplified

Key numbers

0.939
for 6-hour mortality prediction
ML model vs. performance
0.902
for 168-hour mortality prediction
ML model vs. performance
0.317
for 6-hour mortality prediction
ML model vs. performance

Full Text

What this is

  • This study compares the () with a machine learning (ML) model for predicting in-hospital mortality in adult non-traumatic emergency department patients.
  • Using data from five hospitals in Taiwan, the research evaluates the effectiveness of these two approaches over different time frames.
  • The findings indicate that the ML model generally outperforms , particularly in predicting mortality within 6 hours and over longer periods.

Essence

  • The stacking machine learning model predicts in-hospital mortality more accurately than the () in adult non-traumatic emergency department patients, especially for early mortality.

Key takeaways

  • The ML model achieved an () of 0.939 for predicting 6-hour in-hospital mortality, compared to 0.897 for .
  • For predicting 168-hour mortality, the ML model's was 0.902, while dropped to 0.816, indicating a significant decline in performance over time.
  • The ML model also showed a higher () than , with values of 0.317 for 6 hours and 0.215 for 168 hours.

Caveats

  • The study's retrospective design may limit the generalizability of the findings to other settings or populations.
  • Exclusions of patients with out-of-hospital deaths may affect the overall mortality predictions and their clinical relevance.
  • Data imbalance, with non-survival patients comprising less than 1% of the dataset, could influence the performance of the models.

Definitions

  • Modified Early Warning Score (MEWS): A scoring system using physiological parameters to predict clinical outcomes, including mortality, in emergency department patients.
  • Area Under the Receiver Operating Characteristic Curve (AUROC): A performance measurement for classification models, representing the ability to distinguish between positive and negative cases.
  • Area Under the Precision and Recall Curve (AUPRC): A metric for evaluating the performance of a model, particularly useful in imbalanced datasets, focusing on the trade-off between precision and recall.

AI simplified

what lands in your inbox each week:

  • 📚7 fresh studies
  • 📝plain-language summaries
  • direct links to original studies
  • 🏅top journal indicators
  • 📅weekly delivery
  • 🧘‍♂️always free