Systematic evaluation of DNA methylation age estimation with common preprocessing methods and the Infinium MethylationEPIC BeadChip array

Oct 18, 2018Clinical epigenetics

Evaluating DNA age estimates using common data processing methods and a popular methylation test

AI simplified

Abstract

age estimates from the EPIC array show a median absolute difference of 1.44-3.10 years compared to the 450K array.

  • DNAm age is highly correlated across raw and preprocessed data from both the 450K and EPIC platforms (r > 0.91).
  • Chronological age and DNAm age estimates are largely unaffected by differences in measurement platforms and normalization methods.
  • The choice of normalization method can introduce a systematic offset in DNAm age estimates, increasing median error.
  • The remains effective despite missing 19 CpG sites on the EPIC array.
  • The measure of epigenetic age acceleration is robust when considering different normalization methods and measurement platforms.

AI simplified

Key numbers

r > 0.91
Correlation Coefficient
Correlation between age estimates from both arrays.
1.44–3.10 years
Median Absolute Difference
Difference in age estimates across preprocessing methods.

Full Text

What this is

  • This research evaluates the accuracy of age estimation using the EPIC array compared to the older 450K array.
  • It examines the impact of different data preprocessing methods on age estimates derived from data.
  • The study finds that despite missing probes on the EPIC array, age predictions remain reliable across various preprocessing techniques.

Essence

  • age can be accurately estimated using the EPIC array, despite missing 19 CpG sites from the previous 450K array. The choice of preprocessing method influences age estimates, but correlations with chronological age remain strong.

Key takeaways

  • The correlation between age estimates from 450K and EPIC arrays is high (r > 0.91), indicating consistency across platforms.
  • Median absolute differences in age estimates between the two arrays range from 1.44 to 3.10 years, influenced by preprocessing methods.
  • Using age acceleration residuals rather than age acceleration differences provides more reliable comparisons across studies, as it accounts for preprocessing effects.

Caveats

  • The study primarily uses monocyte samples, which may not reflect the importance of missing CpGs in other tissues for age estimation.
  • The results are based on specific preprocessing methods; generalizing these findings to all datasets may not be appropriate.

Definitions

  • DNA methylation (DNAm): Covalent addition of a methyl group to DNA, primarily at cytosine-guanine dinucleotides (CpGs), affecting gene expression.
  • Epigenetic clock: A predictive model estimating biological age based on DNA methylation patterns across multiple CpG sites.

AI simplified

what lands in your inbox each week:

  • 📚7 fresh studies
  • 📝plain-language summaries
  • direct links to original studies
  • 🏅top journal indicators
  • 📅weekly delivery
  • 🧘‍♂️always free