Frontiers in psychiatry

Prediction models for depression after stroke: a review and combined analysis

Updated

Abstract

Neural Network models achieved the highest pooled predictive accuracy for with an of 0.88.

  • The predictive accuracy of various models was assessed using area under the curve (AUC) values from 16 studies.
  • Logistic Regression, Decision Tree, and K-Nearest Neighbor models exhibited comparable AUC values ranging from 0.77 to 0.83.
  • Support Vector Machine models showed the lowest predictive performance with an AUC of 0.68.
  • Functional, physical, and cognitive assessments provided the highest predictive accuracy (AUC = 0.86), while biomarker-based models had an AUC of 0.80.
  • Within retrospective studies, biomarker-based data sources demonstrated significantly higher predictive performance with an AUC of 0.94.
  • Current evidence is limited by small sample sizes and high heterogeneity, necessitating further validation of these predictive models.

Simplified

Key numbers

0.88
Pooled based on two studies with wide confidence intervals.
0.86
Functional Assessment
Reflects the predictive performance of functional, physical, and cognitive assessments.
0.79
Pooled value indicating moderate accuracy.

Key figures

Figure 1
Study selection process for a systematic review and meta-analysis on prediction models
Frames the rigorous filtering process that ensures only relevant, quality studies inform the review’s conclusions
fpsyt-16-1629023-g001
  • Panel Identification
    Records identified from four databases totaling 107, with 62 duplicate records removed before screening
  • Panel Screening
    45 records screened, with 5 excluded based on title and abstract
  • Panel Screening continued
    40 reports sought for retrieval and assessed for eligibility, with none not retrieved
  • Panel Eligibility
    20 reports excluded for reasons including no (12), no depression data (8), and low quality (4)
  • Panel Included
    16 studies included in the final review and meta-analysis
Figure 2
Predictive performance of different models for using values
Highlights higher predictive accuracy in models and variability across model types for post-stroke depression
fpsyt-16-1629023-g002
  • Panels SVM
    Area under the curve (AUC) values for models range from 0.68 to 0.71 with low
  • Panels Decision Tree
    models show AUC values ranging from 0.70 to 0.96 with high heterogeneity; some models appear to have higher AUC near 0.90 or above
  • Panels Logistic Regression
    models have AUC values between 0.59 and 0.93 with high heterogeneity; several models cluster around 0.70 to 0.88
  • Panels Neural Network
    Neural Network models show AUC values of 0.73 and 0.95 with a pooled AUC of 0.88 and very high heterogeneity
  • Panels K-nearest neighbor
    models have AUC values around 0.70 with pooled AUC near 0.80
Figure 3
Predictive performance of different data sources for using values
Highlights higher predictive accuracy in functional, physical, and cognitive tests compared to other data sources
fpsyt-16-1629023-g003
  • Panels Sociological data and Clinical data
    AUC values range from 0.59 to 0.93 with a total random effect AUC of 0.76 (95% : 0.68-0.82)
  • Panels Liver function test
    AUC values range from 0.68 to 0.76 with a total random effect AUC of 0.73 (95% CI: 0.70-0.75)
  • Panels Functional, physical, and cognitive tests
    AUC values range from 0.70 to 0.95 with a total random effect AUC of 0.85 (95% CI: 0.80-0.89)
  • Panels Biomarker
    AUC values range from 0.59 to 0.96 with a total random effect AUC of 0.80 (95% CI: 0.71-0.86)
Figure 4
Predictive accuracy of different models for using values.
Highlights higher predictive accuracy in models compared to and models.
fpsyt-16-1629023-g004
  • Panel Decision Tree
    Shows AUC values for various boosting and forest methods, with total random effect AUC of 0.85 [0.78; 0.90].
  • Panel SVM
    Displays AUC values for SVM models, with total random effect AUC of 0.68 [0.65; 0.70] and no .
  • Panel Logistic Regression
    Includes multiple logistic regression studies with total random effect AUC of 0.79 [0.74; 0.82] and high heterogeneity.
  • Panel Neural Network
    Contains one study with AUC of 0.95 [0.94; 0.96].
  • Panel K-nearest neighbor
    Shows AUC values with total random effect AUC of 0.81 [0.77; 0.84].
Figure 5
Predictive performance of different data sources for using correlation ratios.
Highlights stronger predictive correlations in functional and cognitive tests compared to other data sources for post-stroke depression.
fpsyt-16-1629023-g005
  • Panel Liver function test
    Correlation ratios () for liver function tests range from 0.68 to 0.76 with a pooled COR of 0.73 and high (I² = 91.2%).
  • Panel Functional, physical, and cognitive tests
    COR values for cognitive and physical tests range from 0.70 to 0.95 with a pooled COR of 0.86 and high heterogeneity (I² = 91.6%).
  • Panel Biomarker
    Biomarker COR values vary widely from 0.59 to 0.96 with a pooled COR of 0.80 and very high heterogeneity (I² = 94%).
  • Panel Sociological data and Clinical data
    COR values for sociological and clinical data range from 0.59 to 0.93 with a pooled COR of 0.79 and very high heterogeneity (I² = 96.9%).
  • Panel Total (all data sources combined)
    Overall pooled COR is 0.81 with very high heterogeneity (I² = 96.7%).
1 / 5

Full Text

What this is

  • This systematic review and meta-analysis evaluates clinical prediction models for ().
  • It focuses on the effectiveness of both traditional statistical methods and machine learning approaches.
  • The review synthesizes data from 16 studies to determine which models provide the best predictive accuracy for .

Essence

  • Neural Network models show the highest predictive accuracy for ( = 0.88), although based on limited studies. Traditional models like Logistic Regression also perform competitively, while functional assessments yield the best predictive data.

Key takeaways

  • Neural Networks yield the highest pooled of 0.88 for predicting , although this is based on only two studies with wide confidence intervals.
  • Logistic Regression, Decision Tree, and K-Nearest Neighbor models show comparable performance with pooled values ranging from 0.77 to 0.83.
  • Functional, physical, and cognitive assessments provide the strongest predictive accuracy ( = 0.86), highlighting their importance in early prediction.

Caveats

  • The limited number of studies evaluating machine learning for prediction restricts the generalizability of the findings.
  • Variations in participant characteristics and data quality may contribute to the observed heterogeneity in predictive performance.
  • The predictive performance of the Support Vector Machine model was low ( = 0.68), indicating it may not be suitable for prediction.

Definitions

  • Post-stroke depression (PSD): A common neuropsychological condition following a stroke, linked to cognitive decline and increased mortality.
  • Area under the curve (AUC): A statistical measure used to evaluate the predictive accuracy of a model, with values closer to 1 indicating better performance.

Simplified

what lands in your inbox each week:

  • 📚7 fresh studies
  • 📝plain-language summaries
  • direct links to original studies
  • 🏅top journal indicators
  • 📅weekly delivery
  • 🧘‍♂️always free