What this is
- This research introduces a deep learning framework for estimating biological age (BA) across multiple organ systems using imaging data.
- It leverages magnetic resonance imaging (MRI) and optical coherence tomography (OCT) from over 70,000 participants in the UK Biobank.
- The study establishes () as indicators of accelerated or decelerated aging and their associations with health outcomes.
Essence
- The study demonstrates that imaging-derived biological age estimates can predict health outcomes and mortality, revealing significant associations between accelerated aging and increased disease risk across multiple organs.
Key takeaways
- Imaging-derived biological age estimates show strong prognostic value for health outcomes. Accelerated aging in organs correlates with higher risks of diseases such as chronic kidney disease and myocardial infarction.
- The framework effectively captures aging patterns across organs, even in those with subtle aging features, enhancing the understanding of inter-organ aging relationships.
- () serve as reliable indicators of biological aging, providing insights for personalized health assessments and interventions.
Caveats
- The study's reliance on a specific cohort may limit generalizability due to demographic imbalances, particularly among age, ethnicity, and socioeconomic status.
- Defining thresholds for accelerated aging remains challenging, and the absence of a universal reference standard complicates the interpretation of results.
- Further validation in diverse populations is necessary to confirm the robustness and applicability of imaging-derived biological age as a clinical tool.
Definitions
- Predicted Age Gaps (PAGs): The difference between predicted biological age and chronological age, indicating accelerated or decelerated aging.
AI simplified
Introduction
Human aging is a complex process influenced by biological, environmental, and socio-economic factors, including lifestyle, access to healthcare, and overall health status. These factors drive gradual structural and functional changes in cells, tissues, and organs1. Such alterations are closely linked to the development of many prevalent diseases, including cardiovascular disorders2,3, neurodegenerative conditions like Alzheimer’s disease4,5, or metabolic diseases such as type 2 diabetes6,7. In clinical practice, characteristic age distributions of diseases help to estimate their likelihood in patients, significantly influencing diagnostic decisions and treatment strategies. Moreover, the growing longevity trend, reflected in increasing life expectancy across many populations, amplifies the importance of understanding age-related diseases and adapting healthcare systems accordingly8.
Because these structural and functional changes accumulate over time, chronological age (CA) is commonly used in medicine as an indicator of aging. However, while CA provides a standardized measure, it does not fully capture the biological processes underlying aging, nor does it account for individual variability in age-related phenotypes among individuals of the same chronological age group. The concept of biological age (BA) was introduced as a biomarker to better reflect the true individual variation associated with aging. However, the quantification of BA remains challenging as there is no straightforward definition or ground truth, given the multitude and complexity of influencing parameters.
To address these challenges, conventional methods have primarily relied on clinical biomarkers such as DNA methylation9,10, transcriptomics11,12, functional assessments13,14, laboratory data15,16, and patient histories17 to extract aging-related information. Extensive research focuses on improving these approaches, aiming to develop biomarkers that not only correlate strongly with CA but also demonstrate high prognostic power for age-related diseases and mortality. Yet, many of these methods rely on a single BA estimate for an individual, which may not fully reflect the heterogeneous nature of aging across different organs and tissues. To arrive at a more comprehensive assessment, investigations into BA at the multi-organ level are essential. Recent research by Tian et al. explored the BA estimation across multiple organ systems based on purely physiological and blood-derived phenotypes as well as phenotypes derived from brain imaging18. Individual aging clocks for seven body systems and three brain systems were developed.
The limited amount of available composite clinical biomarkers is usually insufficient to capture all age-related processes. Instead of directly quantifying changes at the cellular or molecular level, imaging-based approaches focus on detecting the observable consequences of the aging process in medical images. However, the information complexity and density demand the need for a systematic and objective analysis. Deep learning is particularly strong in identifying patterns at large-scale, and in addition, eliminates the need for hand-crafted features. Deep learning approaches have attracted attention and shown promising results in predicting the age of individual organs like the brain19–23, cardiovascular system24,25, knee26,27, chest28,29, and abdomen30. Since biological age (BA) is not directly observable in imaging data, most approaches use supervised learning to establish a link to chronological age (CA). Training on a healthy reference cohort allows the model to learn normative aging patterns, which can then be used to identify deviations as accelerated or decelerated aging in broader populations. Given this indirect link, it is equally important to assess how reliable such predictions are. Approaches such as quantifying prediction uncertainty or generating attention maps19,31 can provide insights into model confidence and highlight image regions most influential for the estimate.
Establishing normative aging patterns requires large and diverse reference groups, which are best represented in population-scale cohorts. Resources such as the UK Biobank (UKB)32 or the German National Cohort (NAKO)33 capture health data from the general population, in contrast to clinical studies that often target specific disease phenotypes. Their scale not only provides a robust basis for modeling biological age but also enables analyses across multiple organs, supporting comprehensive assessments, identification of factors shaping aging trajectories, and deeper insights into healthy aging and inter-organ relationships.
Despite these advantages, current imaging-based BA estimation faces several limitations and open challenges. First, even when trained on large healthy cohorts, supervised models remain vulnerable to biases if the selected reference groups do not adequately represent the diversity of the population. Second, while most studies estimate BA for individual organs, the inter-relationships between organs have not yet been systematically explored on the image level. Moreover, these imaging-derived BA estimates are rarely correlated with the rich set of outcomes available in large-scale cohorts such as UKB and NAKO. Although often referred to as an ‘aging clock’ or biomarker18,34, the prognostic value of imaging-derived BA remains uncertain, as many potential confounding factors still require systematic validation. In our own previous work35, we demonstrated the feasibility of multi-organ biological age estimation based on imaging data, providing preliminary evidence that deep learning can capture organ-specific aging trajectories. While these results highlighted the potential of imaging-derived multi-organ BA, they did not yet establish direct links to health outcomes or systematically address confounding effects in the training cohort. Addressing these gaps is essential to establishing reliable and generalizable aging trajectories across diverse populations, ultimately linking organ-level imaging signatures of aging to meaningful health outcomes.
We propose a deep learning-based framework for imaging-derived regional biological age estimation across multiple organ systems, enabling a comprehensive assessment of BA beyond single-organ models. Using an uncertainty-aware ResNet model19,36, we quantify the observable effect of aging from magnetic resonance images (MRI) and optical coherence tomography (OCT). The difference between predicted aging and CA, referred to as predicted age gaps (PAG), serves as an indicator for accelerated or decelerated aging. We further evaluate the prognostic value of the PAGs by relating them to outcomes associated with aging. Finally, by integrating multi-organ BA estimates, we explore the complex dynamics among different organ systems and factors underlying atypical aging, moving toward more reliable and holistic aging trajectories (code and models available at https://github.com/lab-midas/biological_age↗).
Results
Data characteristics and study overview
Biological age predictions were derived from the available imaging data for each organ. As not all participants received every imaging scan, sample sizes vary by organ. Brain BA was estimated using 3D T1-weighted MRIs (N = 51,265, mean CA = 64.4 ± 7.78 years, 25,634 female), cardiac BA from cardiac cine MRIs (N = 68,359, mean CA = 65.41 ± 7.80 years, 36,702 female), and BA in abdominal organs using 3D Dixon scans (N = 70,193, mean CA = 65.4 ± 7.80 years, 36,119 female). Fundus BA was predicted using fundus OCT images (left: N = 87,310, mean CA = 57.65 ± 8.30 years, 45,305 female; right: N = 88,112, mean CA = 57.67 ± 8.30 years, 45,597 female). All images underwent manual quality control by expert readers. For deep learning analysis, imaging data were intensity normalized, and organ-specific segmentation models37–39 cropped images to the respective organs with background removal for input to the BA estimation model.

Organ-specific biological age (BA) estimation framework using magnetic resonance imaging (MRI) of the brain, heart, kidneys, liver, spleen, pancreas, and optical coherence tomography (OCT) scans of left and right fundus. Images of the brain, heart, abdomen, and fundus are first processed through an organ-specific preprocessing pipeline including quality control and organ segmentation. A ResNet-based age regression model is trained for each organ using healthy subjects with chronological age (CA) as reference. Predicted age gaps (PAGs), defined as the deviation between the predicted organ-specific BA and CA, are categorized into accelerated, typical, and decelerated aging based on the thresholdderived from the cohort’s standard deviation for each organ. GradCAM attention maps highlight image regions contributing to BA predictions. θ

Overview of the dataset and cohort characteristics. Data composition () and cohort demographics are shown, including the age distribution of the full cohort () and the number of cases for relevant diseases () along with their respective age distributions (). Only diagnoses recorded within the study period (from individual study entry to data retrieval in 01/2024) are included. a b c d

Cohort flowchart illustrating the exclusion process during preprocessing, including quality control and health-based selection, to define organ-specific healthy training cohorts for biological age estimation.

Kernel density estimations of predicted age gaps in different organs across chronological age (confidence interval = 95%) in healthy subjects (i.e., quality-controlled cohort). Mean absolute error (MAE) + standard deviation and Pearson correlation (r) between predicted and chronological ages are displayed (* indicates< 5.56 × 10). p −3

Kernel density estimations (KDE) of predicted age gaps across organs for specific health outcomes. KDE plots show predicted age gaps in test subjects with pre-existing or future diagnoses (orange) overlaid to healthy controls (blue) from Fig.. Mean values are shown for the healthy cohort (dashed line) and diseased cohort (solid line). 4
Multi-organ biological age

Heatmap of Pearson correlation coefficients between predicted age gaps (PAGs) of different organs. Correlation coefficients are displayed in bold above their corresponding p-values, with statistically non-significant results shown in white (Bonferroni corrected, 36 tests,< 1.39 × 10). p −3
Explainability in BA estimation
In brain scans, the model often focuses on the ventricles, which are known to enlarge with age due to atrophy of surrounding brain tissue, particularly the gray matter40,45 (Fig. 7a). In fundus OCT scans, the model predominantly emphasizes blood vessels and the optic disc which are areas associated with retinal health and aging46,47 (Fig. 7b). In the heart, the model frequently highlights the myocardium and left ventricle, which undergo structural changes with aging, including ventricular dilation and thickening of the myocardial walls48,49 (Fig. 7c). In the kidneys, the model typically focuses on the hilum, where afferent arteriole dilation and other vascular changes are associated with aging50 (Fig. 7d). For the liver, aging is often accompanied by morphological changes51,52. While volumetric changes are difficult to represent in GradCAM analyses, the maps highlight the model’s focus on the shape and surface contours of the liver (Fig. 7e). For the spleen and pancreas, global volume loss is the primary observable age-related change. However, Grad-CAMs struggle to capture these volume changes effectively, making it difficult to identify clear and consistent patterns across different individuals (Fig. 7f, g). The highlighted features in all organs are consistent with previous reports on visual changes due to organ aging, supporting the validity of our proposed age prediction model. Overall, we observed no systematic differences in these patterns between sexes or between healthy and diseased subcohorts.

GradCAM highlights the regions influencing the model’s decisions. Representative activation patterns are shown forbrain,fundus,heart,left kidney,liver,spleen,pancreas. Color-coding indicates regions with strong (1.0) and low influence (0.0). Examples are stratified by sex and disease status. a b c d e f g
Prognostic value of PAGs for mortality
PAGs were evaluated as potential prognostic biomarkers of diseases and mortality by comparing the likelihood of health outcomes across different aging groups using Kaplan-Meier survival analysis. Given that age is a well-established risk factor for many chronic diseases, we tested whether BA could serve as an effective biomarker for predicting disease risk and progression beyond CA. Health outcomes, including chronic diseases, acute conditions, and mortality, were selected based on their impact on organ aging and sufficient sample sizes for statistical testing. Kaplan-Meier survival curves for left-truncated and right-censored data were computed using birth as the start point, imaging appointments (starting in 2014) as the individual entry date, and the most recent data retrieval in 01/2024 as the endpoint. Additionally, we used univariate Cox proportional hazard regression to quantitatively compare survival distributions between accelerated (coded as 2), typical (coded as 1), and decelerated aging groups (coded as 0). To provide an interpretable continuous effect size, we additionally performed univariate Cox regression using continuous PAG values instead of categorical aging groups to estimate risk per 1 year increase in predicted age gap. We report the hazard ratios (HR), the confidence intervals (CI), and p-values. Statistical significance is defined as p < 2.27 × 10−3 (N = 22 tests).

Kaplan-Meier-curves of event-free survival rate. Curves are shown for accelerated aging (>), typical aging (∣∣ ≤), and decelerated aging (< −) of the brain, heart, and pancreas, for the estimated predicted age gap (PAG). P A G S D P A G S D P A G S D
Prognostic value of PAGs for health outcomes
For cardiovascular conditions, including myocardial infarction and chronic ischemic heart disease, we observed a strong, graded association between accelerated heart aging and the onset of these diseases, suggesting that accelerated heart aging is a significant risk factor for cardiovascular conditions. Comparing survival rates between accelerated, typical, and decelerated aging groups revealed a higher hazard for accelerated aging, particularly for the heart (acute myocardial infarction: HR: 2.15, 95% CI: 1.87–2.48, p = 2.63 × 10−26; chronic ischemic heart diseases: HR: 1.88, 95% CI: 1.74–2.03, p = 5.15 × 10−60). Continuous analysis showed that each 1 year increase in heart PAG was associated with a 12% increase in risk for acute myocardial infarction (HR: 1.12, 95% CI: 1.10–1.15, p = 4.09 × 10−27) and an 11% increase in risk for chronic ischemic heart disease (HR: 1.11, 95% CI: 1.10–1.12, p = 5.53 × 10−75).
Regarding kidney health, we investigated chronic kidney disease. A clear systemic trend was observed for accelerated aging in both the left and right kidney. Quantitative analysis further confirmed that individuals with accelerated renal aging had a significantly higher risk of chronic kidney disease, whereas those in the slower aging group exhibited a lower risk (left: HR: 2.60, 95% CI: 2.30–2.93, p = 4.16 × 10−53; right: HR: 2.71, 95% CI: 2.40–3.07, p = 7.23 × 10−56). Continuous PAG analysis revealed that each 1 year increase in left and right kidney PAG was associated with a 19% increase in chronic kidney disease risk, respectively (left: HR: 1.19, 95% CI: 1.16–1.21, p = 4.65 × 10−63; right: HR: 1.19, 95% CI: 1.17–1.22, p = 2.68 × 10−66).
To further evaluate whether organ-specific PAGs provide prognostic information beyond conventional risk factors, we performed a multivariable Cox analysis for type 2 diabetes mellitus, adjusting for chronological age, sex, and BMI. The likelihood ratio test indicated that including all PAGs significantly improved model fit compared with the clinical model alone (p = 8.26 × 10−3), demonstrating that imaging-derived organ aging adds independent predictive value and may complement traditional risk factors.
Together, these findings demonstrate that organ-specific predicted age gaps robustly reflect disease susceptibility, with accelerated aging consistently associated with higher risk across neurological, metabolic, cardiovascular, and renal outcomes.

Kaplan-Meier-curves are shown for accelerated aging (>), typical aging (∣∣ ≤), and decelerated aging (< −) across different organs for the estimated predicted age gap (PAG) and for various diseases. P A G s S D P A G S D P A G s S D Curves are shown for brain aging in patients with Alzheimer’s disease, brain, heart, and pancreas aging in type 2 diabetes mellitus, heart aging in myocardial infarction and chronic ischemic heart disease, and kidney aging in chronic kidney disease.
Discussion
In this study, we conducted a comprehensive imaging-based analysis of biological aging across seven organs, leveraging deep learning techniques to estimate BA from MRI and OCT imaging data. Our approach is purely image-driven, requiring no additional clinical or laboratory information. It thereby provides an objective and automated method for quantifying aging, with the potential to turn this into a fast aging screening method. A large-scale preprocessing pipeline was employed, including automatic organ segmentation, quality control, and intensity normalization. Organ-specific age regression models were trained on a healthy cohort to extract and learn aging-related features without the need for manually predefined biomarkers.
Our work represents an imaging-based quantification of aging across multiple organ systems within the whole body. This approach offers significant advantages over conventional methods using manually curated or preprocessed features, as it eliminates biases introduced by feature selection. By relying solely on imaging data, we ensure a consistent and scalable methodology applicable across large populations. The robustness of our models is further supported by their application to an extensive dataset comprising over 70,000 imaging samples from the UKB, covering the brain, heart, abdomen, and fundus. This large sample size enhances the reliability and generalizability of our BA estimates, facilitating more precise assessments of individual aging trajectories.
Our results demonstrate the feasibility of predicting BA even in organs where aging effects are not easily discernible, such as the pancreas, spleen, and liver. This highlights the sensitivity of deep learning models in capturing subtle structural and functional changes associated with aging. The brain, in particular, provided the most consistent aging patterns, likely due to its prominently observable volumetric changes in gray and white matter. Additionally, brain imaging benefits from greater homogeneity and the standardization of data through MNI152 space mapping53, making age regression less complex and age patterns more easily observable. To improve transparency, we utilized attention maps to visualize the imaging regions most relevant to BA predictions. These highlighted areas align with established literature on age-related organ changes, further validating the reliability of our approach.
Consistent with previous studies18,54, we observed significant heterogeneity in organ aging, reinforcing the necessity of multi-organ analyses. While biological age estimates for different organs were largely independent, we noted expected correlations in bilateral organs such as the kidneys and fundus. Additionally, weak but notable associations were identified among abdominal organs and between the heart and abdominal systems. These findings suggest the potential for identifying clusters of organs that share aging trajectories, which may offer new insights into systemic aging processes.
Beyond characterizing aging patterns, our study highlights the prognostic value of BA estimates. We found that accelerated aging in multiple organs serves as a strong predictor for mortality and disease outcomes. Specifically, accelerated brain aging was identified as a robust predictor for Alzheimer’s disease, while accelerated cardiac aging correlated with an increased risk of myocardial infarction and chronic ischemic heart disease. Similarly, renal aging was a significant risk factor for chronic kidney disease, and accelerated aging in various organs was associated with a higher likelihood of developing type 2 diabetes mellitus. These results underscore the clinical relevance of imaging-derived biological age estimates, potentially informing early interventions and personalized treatment strategies.
Despite the promising results, our study has several limitations. Defining thresholds for accelerated or decelerated aging remains challenging due to the absence of a universally accepted reference standard. We hypothesize that integrating clinical or laboratory biomarkers with imaging data in a multi-modal model could enhance the detection and characterization of decelerated aging while minimizing reliance on hand-crafted features. Furthermore, while our model performed well in predicting BA across a wide range of organs, additional validation in diverse, independent cohorts is necessary to confirm the generalizability of our findings. The current cohort primarily consists of participants over the age of 40 and is characterized by imbalances in ethnicity and socioeconomic status. These characteristics may bias the learned “normative aging patterns” toward those typical of overrepresented groups and introduce domain shift when applying the model to external populations. Such demographic imbalances could confound BA predictions by reflecting lifestyle, healthcare access, or disease prevalence patterns specific to these groups. Lower prediction accuracy observed for organs such as the heart and fundus at age extremes may partly reflect insufficient representation of younger or very old participants. Therefore, the current BA predictions should be interpreted with caution when applied to populations that differ substantially from the study cohort. As a next step, we plan to validate our findings in the German National Cohort study33, which includes younger participants (>20 years). Nevertheless additional validations are required in more diverse demographic cohorts to obtain reliable biological age biomarkers that can in the future be translated into clinical practice. Finally, the independent prognostic value of imaging-derived BA relative to established clinical risk factors remains to be systematically assessed. Conducting such comparisons will be an important next step toward establishing PAGs as robust, clinically actionable biomarkers.
While imaging-derived BA may form a biomarker, its prognostic value is not yet firmly established. Many potential confounders, including lifestyle, comorbidities, medication, and technical factors such as scanner differences, may influence predictions and need to be systematically controlled. Moreover, the extent to which imaging-derived BA provides independent prognostic information beyond conventional risk factors remains to be validated. At the same time, this represents a unique opportunity: by disentangling these confounders and systematically linking imaging-derived BA to clinical outcomes, future work can establish a more reliable and clinically actionable biomarker of aging. While much research has focused on brain aging and its links to mortality and disease, our results show that aging occurs heterogeneously across the body. Age gaps in other organs also provide meaningful prognostic information, highlighting that a multi-organ perspective can capture risk profiles more comprehensively than brain age alone. These findings underscore the value of considering organ-specific aging patterns in developing personalized prevention and treatment strategies.
In conclusion, this study presents a novel, imaging-only deep learning approach for estimating biological age across multiple organs, enabling comprehensive and objective aging assessment. Leveraging over 70,000 MRI and OCT scans from the UK Biobank, our models demonstrated strong performance in capturing aging patterns across organs, including those where age-related changes are less apparent. The resulting PAGs showed strong prognostic value for disease and mortality, highlighting their potential for clinical use and personalized health monitoring. With further validation and systematic comparison to conventional risk factors, PAGs have the potential to serve as actionable biomarkers, helping to translate imaging-derived biological age into practical tools for early detection and individualized healthcare.
Methods
Population study
The UKB cohort serves as a data source in this study, which entails a large-scale collection of health-related information from participants aged 44 to 83 years in the UK. In addition to collecting data on environmental and lifestyle factors, medical history, biological samples, and physical tests, an extensive imaging study has been made accessible. Written informed consent was given from all participants with ethical approval from the NHS National Research Ethics Service North West (11/NW/0382, initially granted in 2011 and renewed in 2016 and 2021), and data are handled in accordance with the Data Protection Act55. This work was carried out under UK Biobank Application 60520 with a data retrieval date of 01/2024. Apart from imaging data, we used non-imaging information from NHS health records, including health outcomes extracted from ICD codes, and detailed demographic data. The study complied with the principles of the Declaration of Helsinki.
Imaging protocols
T1-weighted brain images were obtained using a 3D MPRAGE sequence on a 3T MAGNETOM Skyra scanner (Siemens Healthineers, Erlangen, Germany). For abdominal images, a 3D DIXON VIBE sequence was employed on a 1.5 T MAGNETOM Aera scanner (Siemens Healthineers, Erlangen, Germany). 2D short-axis cardiac cine MRIs were acquired as part of the UKB’s cardiovascular magnetic resonance (CMR) protocol with a bSSFP sequence on a 1.5 T MAGNETOM Aera scanner (Siemens Healthineers, Erlangen, Germany), employing electrocardiogram (ECG) gating for cardiac synchronization. Fundus images were obtained with a 3D OCT device (Topcon 3D OCT1000 Mark II, Topcon Corp., Tokyo, Japan). All MR data in the UKB are available as gray-scale magnitude images with a single input channel, except for abdominal images, which provide the four DIXON contrasts (in-phase, opposed-phase, fat, water) as input. For the fundus OCT scans, the input images consist of three RGB color input channels. An overview of important protocol parameters is given in Fig.. S1
Data preprocessing
Automatic organ segmentation using the nnUNet37,38 was performed on the 3D abdominal MRIs to extract organ-focused images of the left and right kidneys, liver, spleen, and pancreas. Bounding boxes of the organs were determined to exclude irrelevant background. A tolerance margin was added to the outer boundaries of the segmentation masks. Cardiac segmentation was performed using the pre-trained model as proposed in ref. 39. The dimensions of the heart bounding box were determined in the end-diastolic phase. Dynamic 2D images (i.e., 2D + time) of a single slice were provided as input to the network with slice coverage over the complete heart, after image normalization to values between 0 and 1.
Aside from obtaining regional images, the segmentation framework was also used to provide initial indications of poor image quality by marking samples with strongly diverging segmentation volumes. Based on this pre-sorting, the remaining images were assessed visually with respect to their quality. No segmentation was necessary for fundus and brain images. Brain images were pre-processed by skull stripping using the FSL tool56 and MNI152 space mapping.
Biological age estimation
In our previous work, we proposed an uncertainty-aware ResNet-based structure for brain age estimation19. To predict organ-specific ages in this work, we trained the model for all organs separately. The complete dataset was partitioned into healthy and pathological subcohorts using organ-specific ICD codes provided by the NHS, as outlined in Fig. S3. The healthy subcohort was further split into 80% for training and 20% for testing. All models were trained exclusively on the healthy training set, and performance was evaluated on both the healthy test set and the pathological subcohort. The model architecture is kept simple to avoid overconfident predictions by a too powerful architecture. As illustrated in Fig. S2, the model consists of three sequential building blocks, combining convolutional and instance normalization layers with skip connections. Three-dimensional processing blocks were selected for the MRI data, while two-dimensional layers were chosen for fundus images. Morphological and physiological variations among individuals can lead to inconsistencies in the chronological age labels. Visually similar organs may have different CAs, introducing statistical (aleatoric) and input-dependent (heteroscedastic) uncertainty into age estimates. We model this uncertainty by replacing the deterministic age predictions with a heteroscedastic Gaussian predictive distribution of the estimated age, characterized by the mean μ(x) and the logarithm of the standard deviation \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\log \sigma (x)$$\end{document}logσ(x) for a given organ image x. Thus, for the CA label y conditioned on the input image x and model parameters θ we obtain19,57:1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho (y| x,\theta )={\mathcal{N}}(y;\mu (x,\theta ),\sigma (x,\theta ))$$\end{document}ρ(y∣x,θ)=N(y;μ(x,θ),σ(x,θ))The standard deviation σ(x, θ) of the Gaussian predictive distribution signifies the degree of uncertainty, while μ(x, θ) represents the age estimate. The model parameters are optimized by minimizing the negative log-likelihood2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal{L}}({\mathcal{B}},\theta )=\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}\frac{1}{2\sigma {({x}_{i},\theta )}^{2}}{({y}_{i}-\mu ({x}_{i},\theta ))}^{2}+\log \sigma ({x}_{i},\theta )$$\end{document}L(B,θ)=1N∑i=1N12σ(xi,θ)2(yi−μ(xi,θ))2+logσ(xi,θ)for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal{N}}$$\end{document}N independent and identically distributed samples of mini-batch \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathcal{B}}$$\end{document}B.
For training, 4× 48GB RTX A6000 GPUs were available. Hyperparameters were adopted from19 where hyperparameter optimization was performed. Sample sizes vary across organs since not all participants underwent every imaging protocol, and artifact-corrupted organ images were excluded through additional manual quality control measures. Training was performed for 200 epochs using the Adam optimizer58. Further training parameters are stated in Fig. S2.
Predicted age gaps
The difference between predicted BA and CA often serves as an indicator of biological age. To accurately compute the PAGs, it is necessary to address the inherent biases in the age estimation model, known as regression to the mean bias, which tends to overestimate ages for younger individuals and underestimate ages for older individuals59. Traditional linear corrections model the predicted age gap solely as a function of chronological age. While effective at removing age-dependent bias, they can inadvertently remove variance linked to sex, disease status, or other biological factors, potentially masking clinically relevant differences. To address this limitation, we chose a correction method that explicitly separates the linear effect of CA from other sources of variation. To implement this, we first modeled the median of the PAGs (ΔAi) as a function of chronological age \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${A}_{i}^{CA}$$\end{document}AiCA and additional covariates (sex, existing pathology, and future pathology), whose influence we aim to preserve:3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{Median}}(\Delta {A}_{i}| {X}_{i})={\beta }_{0}+{\beta }_{1}^{\top }{X}_{i}$$\end{document}Median(ΔAi∣Xi)=β0+β1⊤Xiwhere Xi denotes the set of covariates. We chose median regression because the median is less sensitive to outliers than the mean, ensuring that extreme predictions do not distort the correction. Predicted ages \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{A}}_{i}^{{\mathrm{pred}}}$$\end{document}A^ipred were then corrected relative to a reference age \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${A}_{i}^{{\mathrm{ref}}}$$\end{document}Airef (the cohort median) by subtracting the linear age effect:4\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\widehat{A}}_{i}^{{\mathrm{pred,corr}}}={\widehat{A}}_{i}^{{\mathrm{pred}}}-{\beta }_{{1}_{age}}\cdot [{A}_{i}^{{\mathrm{CA}}}-{A}_{i}^{{\mathrm{ref}}})]$$\end{document}A^ipred,corr=A^ipred-β1age⋅[AiCA-Airef)]where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\beta }_{{1}_{age}}$$\end{document}β1age is the regression coefficient corresponding to chronological age. Anchoring to the reference age means that the correction is centered on the cohort median, removing systematic over- or underestimation relative to this central point. After correcting for this bias, we calculate the PAGs for all organs in each subject within the test dataset, allowing us to gain insights into the aging patterns across multiple organs for each individual.
Survival analysis
To evaluate whether the predicted age gaps can effectively predict age-related outcomes, we employ Kaplan-Meier survival analysis. This statistical method is widely used in clinical research to assess the effectiveness of treatments by observing the survival rates over time. Typically, survival is defined by the absence of death; however, any health outcome with a distinct start date can serve as the criterion to differentiate between surviving and non-surviving cases. We exploit the Kaplan-Meier plot to analyze the differences in survival rates based on PAGs, thereby evaluating their potential as indicators for age-related health outcomes, like myocardial infarction. Instead of investigating the effects of a certain intervention, we group subjects according to their predicted age gap into accelerated (PAG > SD) and decelerated aging (PAG < − SD) and analyze the differences in survival.
First, we exclude participants who experienced the health event prior to their imaging, as our focus is on the prognostic capability of the predicted age gaps rather than retrospective detection. Consequently, our subcohort includes participants who either have not experienced the health event (event = 0) or have experienced the health outcome after their imaging scan with a clearly defined time to the event (event = 1). We use the age at imaging as the starting point for each subject and the age at our data retrieval as the study’s endpoint. Individuals who died from causes other than the targeted health event during the study period are marked as censored observations. We plot the cumulative probability of the event’s occurrence across chronological age, grouped by predicted age gaps. The rate at which the curve decreases reflects the survival duration within each group, offering insights into the influence of accelerated or decelerated aging on the likelihood of the event over time.
To quantitatively assess differences in survival between aging groups, we apply a univariate Cox proportional hazards regression model using aging group classification as the predictor variable. Aging groups are encoded as 0 (decelerated), 1 (typical), and 2 (accelerated). The analysis is performed on left-truncated and right-censored data, prepared consistently with the Kaplan-Meier analysis. In addition to categorical aging groups, we also perform univariate Cox regression using the continuous PAG as the predictor variable. This allows estimation of hazard ratios per 1 year increase in PAG, providing a more interpretable effect size for clinical risk assessment. Continuous PAG values are analyzed using the same left-truncated and right-censored setup as the Kaplan-Meier and group-based Cox analyses.
To test whether PAGs add predictive value beyond conventional clinical risk factors, we performed multivariable Cox regression adjusting for chronological age, sex, and BMI, focusing on type 2 diabetes due to its multi-organ impact. Likelihood ratio tests were used to compare models with and without PAGs, allowing assessment of their independent prognostic contribution.
Supplementary information
Supplementary Information