Differential gene expression profiling and machine learning-based discovery of key genetic markers in VTE and CKD

Nov 7, 2025Frontiers in immunology

Identifying important genes linked to blood clots and kidney disease using gene analysis and machine learning

AI simplified

Abstract

A total of 23 overlapping genes were identified between patients with venous thromboembolism (VTE) and chronic kidney disease (CKD).

  • 637 (DEGs) were identified in VTE patients, with 413 upregulated and 224 downregulated.
  • 671 DEGs were identified in CKD patients, consisting of 99 upregulated and 572 downregulated genes.
  • Functional analyses revealed that VTE DEGs are mainly associated with cytoplasmic translation, immune activation, and oxidative phosphorylation.
  • CKD DEGs were enriched in pathways related to muscle contraction regulation, ATPase activity, and vascular smooth muscle contraction.
  • HNRNPA0 and PI4KA were identified as the most robust feature genes associated with both conditions, demonstrating excellent diagnostic performance ( = 1.000).
  • Validation showed significantly lower expression of HNRNPA0 and PI4KA in CKD samples compared to controls.

AI simplified

Key numbers

637
in
413 upregulated and 224 in patients.
671
in
99 upregulated and 572 in patients.
1.000
for Key Genes
calculated from machine learning analyses.

Key figures

Figure 1
Gene expression differences in and patients compared to controls
Highlights contrasting gene regulation patterns with more in VTE and more in CKD
fimmu-16-1654673-g001
  • Panel A
    of in VTE patients versus controls, with red showing upregulated and blue showing downregulated genes
  • Panel B
    Heatmap of differentially expressed genes in CKD patients versus controls, with red indicating upregulated and blue indicating downregulated genes
  • Panel C
    showing 224 downregulated and 413 upregulated genes in VTE patients compared to controls
  • Panel D
    Upset plot showing 572 downregulated and 99 upregulated genes in CKD patients compared to controls
Figure 2
GO and KEGG enrichment terms for in and patients
Highlights distinct biological processes and pathways enriched in VTE versus CKD gene expression profiles
fimmu-16-1654673-g002
  • Panel A
    Top terms for VTE genes including cytoplasmic translation, immune response activation, and ribosomal subunits
  • Panel B
    Top GO enrichment terms for CKD genes highlighting muscle contraction regulation, cation-transporting ATPase complex, and endopeptidase inhibitor activity
  • Panel C
    Top for VTE genes focusing on oxidative phosphorylation, reactive oxygen species related to chemical carcinogenesis, and COVID-19 signaling
  • Panel D
    Top KEGG pathways for CKD genes emphasizing pancreatic secretion, protein digestion and absorption, and vascular smooth muscle contraction
Figure 3
Overlap and expression levels of 23 shared genes in and patients versus controls
Highlights shared gene expression changes with significant differences in CKD and VTE patients versus controls
fimmu-16-1654673-g003
  • Panel A
    Venn diagram showing 23 genes differentially expressed in both CKD and VTE, with key genes labeled
  • Panel B
    Bar chart of gene expression levels for the 23 overlapping genes in CKD patients versus controls, with several genes showing statistically significant differences
  • Panel C
    Bar chart of gene expression levels for the 23 overlapping genes in VTE patients versus controls, with multiple genes significantly different; treated group appears to have lower expression for some genes
Figure 4
Feature gene selection and diagnostic performance in and using three machine learning algorithms
Highlights perfect diagnostic accuracy and key gene overlap for HNRNPA0 and PI4KA across three machine learning methods
fimmu-16-1654673-g004
  • Panel A
    algorithm tuning with 10-fold cross-validation showing coefficient paths and optimal lambda value indicated by a vertical line
  • Panel B
    feature ranking plot highlighting HNRNPA0 and PI4KA as the top two features with highest 5-fold cross-validation accuracy
  • Panel C
    SVM-RFE variable importance plot showing gene importance scores with HNRNPA0 and PI4KA among the highest
  • Panel D
    error rate plot across number of trees, showing error stabilizes near zero as trees increase
  • Panel E
    Random Forest variable importance plot with genes on y-axis and importance on x-axis; genes with importance >0.8 are highlighted, including HNRNPA0 and PI4KA
  • Panel F
    Venn diagram showing overlap of feature genes selected by LASSO, SVM-RFE, and Random Forest algorithms; HNRNPA0 and PI4KA are common to all three
  • Panels G and H
    ROC curves for PI4KA and HNRNPA0 respectively, both showing perfect diagnostic performance with values of 1.000
Figure 5
risk prediction using gene expression of HNRNPA0 and PI4KA with model evaluation metrics
Highlights the predictive accuracy and clinical utility of gene-based models for estimating CKD risk
fimmu-16-1654673-g005
  • Panel A
    assigning points to HNRNPA0 and PI4KA expression levels to estimate total points and predict CKD risk probability
  • Panel B
    comparing observed versus predicted CKD risk with a 45-degree line indicating perfect calibration
  • Panel C
    showing net benefit of the nomogram model compared to treating all or no patients across threshold probabilities
  • Panel D
    for the nomogram model with an (AUC) of 0.733 indicating diagnostic performance
  • Panel E
    ROC curves for individual feature genes HNRNPA0 and PI4KA showing their sensitivity and specificity in predicting CKD
1 / 5

Full Text

What this is

  • This research investigates the genetic links between venous thromboembolism (VTE) and chronic kidney disease (CKD).
  • It employs transcriptomic analysis and machine learning to identify key () shared by both conditions.
  • The study identifies HNRNPA0 and PI4KA as significant biomarkers with potential diagnostic applications.

Essence

  • HNRNPA0 and PI4KA are identified as key genes linking VTE and CKD, demonstrating strong diagnostic potential and involvement in immune and metabolic pathways.

Key takeaways

  • 637 were found in VTE patients, with 413 upregulated and 224 downregulated. In CKD, 671 were identified, including 99 upregulated and 572 downregulated.
  • Twenty-three overlapping between VTE and CKD were identified, with HNRNPA0 and PI4KA showing excellent diagnostic performance ( = 1.000) in machine learning analyses.
  • The diagnostic nomogram based on HNRNPA0 and PI4KA demonstrated high predictive accuracy and calibration, indicating its potential utility in clinical settings.

Caveats

  • The study's validation was conducted on a limited cohort size, which may affect the generalizability of the findings.
  • Direct mechanistic evidence for the roles of HNRNPA0 and PI4KA in CKD and VTE remains to be established through further functional studies.

Definitions

  • Differentially Expressed Genes (DEGs): Genes that show significant differences in expression levels between different conditions or groups.
  • Area Under the Curve (AUC): A measure of the diagnostic performance of a test, with 1.000 indicating perfect accuracy.

AI simplified

what lands in your inbox each week:

  • 📚7 fresh studies
  • 📝plain-language summaries
  • direct links to original studies
  • 🏅top journal indicators
  • 📅weekly delivery
  • 🧘‍♂️always free