Quantification of the pace of biological aging in humans through a blood test: the DunedinPACE DNA methylation algorithm

Measures to quantify changes in the pace of biological aging in response to intervention are needed to evaluate geroprotective interventions for humans. Here, we report an advance on our original method (Belsky et al. 2020). We used data from the Dunedin Study 1972-3 birth cohort tracking within-individual decline in 19 organ-system integrity indicators across four timepoints spanning two decades to model Pace of Aging. We distilled two-decade Pace of Aging into a single-time-point DNA-methylation blood-test using elastic-net regression and DNA-methylation data restricted to exclude probes with low test-retest reliability. The resulting measure, DunedinPACE, showed high test-retest reliability, was associated with functional decline, morbidity, and mortality, and indicated accelerated Pace of Aging in young adults with childhood adversity across five datasets. DunedinPACE effect-sizes were similar to GrimAge-clock effect-sizes and larger than those for other benchmark DNA-methylation-clocks. DunedinPACE is a novel blood biomarker of the pace of aging for gerontology and geroscience


INTRODUCTION
Guided by research on the cellular hallmarks of aging, model-organism studies are yielding treatments that slow the accumulation of deterioration in cells and organs (Barzilai et al., 2018;Bellantuono, 2018;Campisi et al., 2019). However, human geroprotector trials face a barrier when it comes to outcome measures: humans' long lifespan makes it prohibitively timeconsuming to test whether a treatment extends healthspan (Justice et al., 2018). What is needed is a way to measure each clinical trial participant's personal pace of biological aging, before, during, after, and at long-term follow-up, to test whether a gero-protective therapy slows that pace, and whether benefits fade out (Moffitt et al., 2017;Sierra et al., 2021).
An ideal research design for isolating biological patterns that differentiate faster and slower aging is longitudinal follow-up of a cohort of individuals who are all the same chronological age. We originally applied this design to test the geroscience-derived hypothesis that the process of biological aging was ongoing in healthy young adults and, critically, that it was already variable decades before most chronic diseases onset Elliott et al., 2021). We studied a single-year birth cohort of all 1,037 babies born in the city of Dunedin, New Zealand, during 1972-73 and followed-up through midlife. Because aging can be understood as gradual, progressive deterioration simultaneously affecting different organ systems, we tracked declines in the cardiovascular, metabolic, renal, hepatic, immune, periodontal, and pulmonary systems of each participant using multiple biomarker measurements of each organ system, first at ages 26, 32, and 38 years, and subsequently including a fourth assessment at age 45 years. Modelling this one-of-a-kind dataset yielded a metric which quantified how slowly or rapidly each participant had been aging. We called this metric the Pace of Aging.
People with faster Pace of Aging tend to experience more rapid aging-related decline in physical and cognitive functions and develop early signs of brain aging Elliott et al., , 2021. This evidence suggests that intervention to slow Pace of Aging could preserve functions lost with aging and extend healthspan. However, Pace of Aging takes many years to measure and requires data from several types of clinical and biological assays, limiting its utility in clinical trials. To address this limitation, we applied machine-learning tools to genome-wide DNA methylation data from Dunedin cohort members' blood samples to distill the multi-timepoint, multi-assay Pace of Aging measure into a singletimepoint, single-assay blood biomarker: DunedinPoAm (for Dunedin Study Pace of Aging from methylation) (Belsky et al., 2020). DunedinPoAm was designed to measure Pace of Aging biological change over time from a single blood sample. Like the original Pace of Aging, people with faster DunedinPoAm scores more often experienced declines in cognitive and physical functioning by midlife and showed more rapid facial aging (Belsky et al., 2020); in older adults, faster DunedinPoAm predicted increased risk of disease and death (Belsky et al., 2020;Graf et al., In Press); in young people, experiences of early-life adversity were linked to faster DunedinPoAm (Belsky et al., 2020;Raffington et al., 2021).
DunedinPoAm provided proof of concept that Pace of Aging could be measured from a single blood test. But DunedinPoAm was based on biological change observed over just twelve years spanning the transition from young adulthood to midlife, limiting the scope of biological change that could be observed. And this biological change was measured from just three assessment waves, limiting the precision. In addition, similar to other DNA methylation composites measured from microarray data, the measure had only moderate test-retest reliability (Higgins-Chen et al., 2021), limiting its value to clinical trials seeking to test within-participant changes from pre-treatment baseline to post-treatment follow-up. To address these limitations, we incorporated new data from the Dunedin cohort extending Pace of Aging follow-up to include a fourth measurement occasion in the fifth decade of life (Elliott et al., 2021) and restricted DNA-methylation data to exclude probes with poor reliability (Sugden et al., 2020) to refine our DNA-methylation biomarker of Pace of Aging. The longer follow-up period, which allowed us to observe more aging-related change in system integrity, and the additional measurement occasion included in analysis improve the power and precision of our modeling of longitudinal change in system integrity (Brandmaier et al., 2018;Rast and Hofer, 2014). The refined DNA-methylation dataset used for machine-learning improves reliability of measurement. We name the novel algorithm DunedinPACE, for Dunedin (P)ace of (A)ging (C)alculated from the (E)pigenome.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

RESULTS
We developed the DunedinPACE measure reported here by analyzing the pace of biological aging in a 1972-73 birth cohort (N=1037), the Dunedin Study. This analysis consisted of two parts. The first part of analysis followed our original method  to quantify Pace of Aging from two decades of longitudinal organ-system integrity data (Elliott et al., 2021). First, we measured longitudinal changes in 19 biomarkers assessing cardiovascular, metabolic, renal, hepatic, immune, dental, and pulmonary systems, measured at ages 26,32,38 and, most recently in 2019, 45 years (Supplementary Table S1 provides measurement details about the variables). Second, linear mixed-effects modelling was used to fit growth models to estimate each study member's personal rate of change for each of the 19 biomarkers. Third, these personal rates of change were combined across biomarkers to calculate each Study member's personal Pace of Aging. In line with the geroscience hypothesis (Kaeberlein, 2013;Kennedy et al., 2014), which states that aging represents progressive decline across organ systems, we calculated each Study member's Pace of Aging as the sum of age-dependent annual changes across all biomarkers: Pace of Agingi = ∑ !"# #$ μ1iB. The resulting Pace of Aging was then scaled to a mean of 1, so that it could be interpreted with reference to an average rate of 1 year of biological aging per year of chronological aging. Study members showed wide variation in their Pace of Aging (Mean=1 biological year per chronological year, SD=0.29). Over the two decades that we measured biological aging, the Study member with the slowest Pace of Aging aged by just 0.40 biological years per chronological year, while the Study member with the fastest Pace of Aging accrued 2.44 biological years per chronological year (Elliott et al., 2021).
The second part of the analysis followed our original method to distill Pace of Aging into a blood test (Belsky et al., 2020). Using Illumina EPIC array DNA methylation data from blood collected at age 45, we developed DunedinPACE to provide a surrogate for the 20-year Pace of Aging. Briefly, we used elastic-net-regression (Zou and Hastie, 2005) to develop a DNA methylation algorithm to predict the 20-year Pace of Aging. Analysis included the subset of probes included on both the Illumina 450k and EPIC arrays which we previously determined to have acceptable test-retest reliability (ICC>0.4, n=81,2392) (Sugden et al., 2020). Following the method established by Horvath (Horvath, 2013) and used in our prior analysis, we fixed the alpha parameter of the elastic net at 0.5. The final lamda value selected by the analysis was . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi. org/10.1101org/10. /2021 0.042. The resulting algorithm included 173 CpG sites. The CpGs included in DunedinPACE are listed in Supplemental Table S2  We also repeated analysis of decline in physical and cognitive functions included in our original report of DunedinPoAm. This analysis tested associations between DunedinPACE and worsening in physical functions and subjective signs of aging between the age-38 and age-45 assessments and change in cognitive functioning from adolescent baseline to the age-45 assessment. Effect-sizes are graphed in Figure 1c.

DunedinPACE shows exceptional test-retest reliability.
A Pace of Aging measure should have high test-retest reliability in order to be able to quantify change from pre-treatment baseline to post-treatment follow-up in intervention studies or exposure-related changes in longitudinal observational studies. We first evaluated test-retest reliability of DunedinPACE using a publicly . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted September 2, 2021.

DunedinPACE indicates a faster Pace of Aging in chronologically older individuals.
In demography, the pace of biological aging in a population can be estimated from the rate of increase in mortality risk from younger to older chronological ages. In humans and many other species, mortality risk increases exponentially with advancing chronological age, suggesting that the pace of biological aging accelerates as we grow older (Gompertz, 1825;Kirkwood, 2015;Olshansky and Carnes, 1997). We tested if DunedinPACE indicated faster Pace of Aging in chronologically older as compared to younger individuals in data from the Understanding Society Study (n=1175, age range 28-95). Chronologically older Understanding Society participants had faster DunedinPACE Pace of Aging as compared to younger ones (r=0.32,

Figure 3 Panel A).
This correlation was threefold larger as compared to the correlation for the original DunedinPoAm (r=0.11).
Between-individual correlation of older chronological age with faster DunedinPACE could be confounded by differences in exposure histories across birth cohorts, i.e. not an effect of within-individual aging (Moffitt, 2020). To rule out such confounding, we conducted analysis of within-individual change in DunedinPACE in the US Veterans Administration Normative Aging Study (NAS). Among NAS participants with repeated-measures DNA methylation data (n=536), . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted September 2, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 DunedinPACE increased by b=0.021 (SE=0.003) per 5 years of follow-up, or approximately 0.2 standard deviation units, similar to what we previously reported for the original DunedinPoAm (Belsky et al., 2020).
DunedinPACE shows faster Pace of Aging in individuals measured to be biologically older using prior measures derived from DNA methylation and physiological data, and who report poorer self-rated health. We compared DunedinPACE with the epigenetic clocks proposed by Horvath, Hannum et al., Levine et al., and Lu et al. (Hannum et al., 2013;Horvath, 2013;Levine et al., 2018;Lu et al., 2019), with measurements of biological age derived from physiological data, and with participants' subjective perceptions of their own health status in data from the UK-based Understanding Society Study (n=1175, mean age=58, SD=15, 42% male).
Epigenetic Clock Analysis. For each epigenetic clock, we calculated the measure referred to as epigenetic "age acceleration" by regressing participants' clock-estimated ages on their chronological ages and computing residual values. This measure is often used as an estimate of the aging rate. It quantifies the difference between the amount of aging expected based on a person's chronological age and the amount of aging observed based on DNA methylation. DunedinPACE was weakly correlated with age acceleration computed from the Horvath clock (r=0.13) and somewhat more strongly correlated with age acceleration computed from the Hannum and Levine clocks (Hannum r=0.26;PhenoAge r=0.36). These correlations were similar to what we observed for the original DunedinPoAm measure. For the GrimAge clock proposed by Lu et al., the correlation with the age-acceleration residuals was stronger (r=0.58).
Correlations are plotted in Figure 3 Panel B.
Physiology-based Measurements of Biological Age. We used published methods to measure participants' biological ages from blood-chemistry, blood-pressure, and lung function data collected at the same time as the DNA methylation data. We computed indices of biological age using three different methods: the Klemera and Doubal method (KDM) Biological Age estimates the age at which a person's physiology would appear normal in a reference population (Klemera and Doubal, 2006); the Phenotypic Age method estimates the age at which a person's physiology-predicted mortality risk would be approximately normal in a reference population (Levine et al., 2018); the homeostatic dysregulation method estimates the extent to which a person's physiology deviates from a state of health (Cohen et al., 2013). These algorithms . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted September 2, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 generate correlated but distinct estimates of biological aging (Hastings et al., 2019;Parker et al., 2019). Algorithms were parameterized using a set of biomarkers available in Understanding Society and included in Levine's original implementation of the Klemera-Doubal Biological Age (Levine, 2013) (albumin, alkaline phosphatase, blood urea nitrogen, creatinine, C-reactive protein, HbA1C, systolic blood pressure, forced expiratory volume in one second) and training data from the US National Health and Nutrition Examination Surveys (NHANES). Consistent with analysis of epigenetic age acceleration, participants with more advanced biological aging DunedinPACE is associated with morbidity and mortality. We conducted analysis of morbidity and mortality in older men in the VA Normative Aging Study (NAS) and in the Framingham Heart Study Offspring cohort.
NAS analysis was conducted using the same methods reported in our original article (Belsky et al., 2020). Briefly, analysis included n=771 older men with average age of 77 years (SD=7) at DNA methylation measurement. Over follow-up from DNA collection in 1999-2013, 46% died and 23% were diagnosed with a new chronic disease, including any of hypertension, type-2 diabetes, cardiovascular disease, chronic obstructive pulmonary disease, chronic kidney disease, and cancer. Participants with faster DunedinPACE were at increased risk for morbidity and mortality (for incident chronic disease morbidity, DunedinPACE HR=1.23 [1.07-1.42]; for . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021.  Figure 4A). Study. The E-Risk Study follows a 1994-95 birth cohort of same-sex twins. Blood DNA methylation data were collected when participants were aged 18 years. We analyzed two exposures associated with shorter healthy lifespan, childhood low socioeconomic status and childhood victimization. Socioeconomic status was measured from data on parents' education, occupation, and income (Trzesniewski et al., 2006). Victimization was measured from exposure dossiers compiled from interviews with the children's mothers and home-visit assessments conducted when the children were aged 5, 7, 10, and 12 (Fisher et al., 2015). The dossiers recorded children's exposure to domestic violence, peer bullying, physical and sexual harm by an adult, and neglect. 72% of the analysis sample had no victimization exposure, 21% had one type of victimization exposure, 4% had two types of exposure, and 2% had three or more types of exposure. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint 0.77]; Figure 5). These associations with early-life adversity were robust to covariate adjustment for cell counts estimated from the DNA methylation data, to covariate adjustment for smoking behavior (smoking status and quantity), and to a model specification that excluded smokers (Supplemental Table 4). differences between un-victimized and those with one type of victimization were attenuated below the alpha=0.05 threshold in E-Risk Study analysis including smoking as a covariate; differences between high-and middle-SES groups were attenuated below the alpha=0.05 threshold in E-Risk Study analysis excluding smokers). Compared to the original DunedinPoAm, covariate adjustment for smoking tended to have a smaller impact on effect-size estimates for DunedinPACE. In follow-up analysis of the Understanding Society Study, we found that participant-reported smoking history explained 30% of variance in DunedinPoAm, but only 13% of variance in DunedinPACE. Results were similar in analysis of the Framingham Heart Study Offspring Cohort (R 2 =23% for DunedinPoAm vs. 12% for DunedinPACE).

DISCUSSION
We developed a novel DNA methylation measure of the pace of biological aging from analysis of two decades of longitudinal data from members of the Dunedin Study 1972-3 birth cohort. As in our original report, we utilized a two-step approach that first modeled the Pace of Aging from . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. As a result of these advances, DunedinPACE provides a more precise measurement of the Pace of Aging as compared to the original DunedinPoAm and shows higher test-retest reliability.
Analysis of this more precise and reliable measurement revealed stronger associations with signs of aging. DunedinPACE showed a stronger correlation with chronological age as compared to the original DunedinPoAm, consistent with demographic evidence that biological processes of aging accelerate in later life (Finch and Crimmins, 2016). In analysis of construct and criterion validity, DunedinPACE was associated with functional decline in midlife adults; it was correlated with measures of biological age derived from blood chemistry and DNA methylation data, and with research participants' subjective perceptions of their own health; it was predictive of morbidity and mortality in midlife and older adults; and it indicated faster Pace of Aging in young adults with histories of exposure to poverty and victimization. DunedinPACE effect-sizes were similar to or larger than those for the original DunedinPoAm measure even as DunedinPACE was less influenced by participant smoking history. In analysis that took smoking history into account, DunedinPACE effect-sizes were more robust than those for DunedinPoAm, most notably in analysis of mortality in the Framingham Heart Study and of early-life adversity in the E-Risk cohort.
DunedinPACE effect-sizes in analysis of construct and criterion validity were also larger than those for benchmark DNA methylation clocks proposed by Horvath, Hannum et al., and Levine et al. (PhenoAge), and similar to those for the clock proposed by Lu et al. (GrimAge), and were robust to covariate adjustment for smoking and DNA methylation estimates of cell counts. DunedinPACE therefore represents an advance on the original DunedinPoAm measure . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint and provides a novel tool for geroscience research that can complement existing DNA methylation measures of aging in analysis of observational studies and randomized trials.
We acknowledge limitations. Foremost, the Dunedin Study sample we analyzed to develop DunedinPACE is a relatively modestly sized cohort and is drawn from a single country.
As additional cohorts develop the multi-time-point organ-system-function datasets to model Pace of Aging, integrating these data into DNA methylation algorithms to measure Pace of Aging may enhance precision. Nevertheless, DunedinPACE does show consistent evidence of criterion and construct validity in several additional datasets, boosting confidence in its external validity. In addition, our work on DunedinPACE thus far has not addressed population diversity in biological aging. The Dunedin cohort and the Understanding Society, NAS, Framingham, and E-Risk cohorts were mostly of white European descent. Follow-up of DunedinPACE in more diverse samples is needed to establish cross-population validity. Importantly, our original DunedinPoAm measure has been followed-up in diverse cohorts and has showed evidence of consistent validity across race/ethnic subgroups (Crimmins et al., 2021;Graf et al., In Press;Raffington et al., 2021;Schmitz et al., 2021). As with all proposed measures of biological aging, ultimately establishing DunedinPACE as a surrogate endpoint for healthspan will require evidence that it is modifiable by intervention and that intervention-induced changes result in changes in healthy-lifespan phenotypes (Justice et al., 2018;Prentice, 1989).
Within the bounds of these limitations, our analysis establishes DunedinPACE as a novel single-time-point measure that quantifies Pace of Aging from a blood test. It can be implemented in Illumina 450k and EPIC array data, making it immediately available for testing in a wide range of existing datasets as a complement to existing methylation measures of aging. Because of its high test-retest reliability, DunedinPACE offers a unique measurement for intervention trials and natural experiment studies investigating how the rate of aging may be changed by behavioral or drug therapy, or by environmental modification.

Data Sources
Data were used from five studies: Dunedin Study, Understanding Society Study, the Normative Aging Study (NAS), the Framingham Heart Study, and the Environmental Risk (E-. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint Risk) Longitudinal Twin Study. In addition, we accessed the Gene Expression Omnibus dataset GSE55763. The datasets and measures analyzed within each of them are described below.
The Dunedin Study is a longitudinal investigation of health and behavior in a complete birth cohort. Study members (N=1,037; 91% of eligible births; 52% male) were all individuals born between April 1972 and March 1973 in Dunedin, New Zealand (NZ), who were eligible based on residence in the province and who participated in the first assessment at age 3. The cohort represents the full range of socioeconomic status on NZ's South Island and matches the NZ National Health and Nutrition Survey on key health indicators (e.g., BMI, smoking, physical activity, GP visits) . The cohort is primarily white (93%) . Assessments were carried out at birth and ages 3,5,7,9,11,13,15,18,21,26,32,38 and, most recently, 45 years, when 94% of the 997 study members still alive took part. At each assessment, each study member is brought to the research unit for a full day of interviews and examinations. Study data may be accessed through agreement with the Study investigators (https://moffittcaspi.trinity.duke.edu/research-topics/dunedin). Dunedin Study measures of physical and cognitive functioning and subjective signs of aging are described in detail in Supplemental Table 3. GSE55763 is a publicly available dataset including technical replicate Illumina 450k DNA methylation array data for 36 adult human samples (Lehne et al., 2015). Data are available from the Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE55763).
Understanding Society is an ongoing panel study of the United Kingdom population (https://www.understandingsociety.ac.uk/). During 2010-12, participants were invited to take part in a nurse's exam involving a blood draw. Of the roughly 20,000 participants who provided clinical data in this exam, methylation data have been generated for just under 1,200. We analyzed data from 1,175 participants with available methylation and blood chemistry data.
Documentation of the methylation (University of Essex) and blood chemistry (University of Essex) data resource is available online . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
Physiology-based Biological Age Measures. We measured biological aging from blood chemistry, systolic blood pressure, and lung-function data using the algorithms proposed by Klemera and Doubal, Levine, and Cohen (Cohen et al., 2013;Klemera and Doubal, 2006;Levine, 2013;Levine et al., 2018), trained in data from US National Health and Nutrition Examination Surveys (NHANES; https://wwwn.cdc.gov/nchs/nhanes/Default.aspx) following the methods described by Hastings (Hastings et al., 2019) and using the software developed by Kwon (https://github.com/dayoonkwon/BioAge). Following the procedure in our original eLife article (Belsky et al., 2020), we included 8 of 10 biomarkers included in Levine's original implementation of the Klemera-Doubal method Biological Age (Levine, 2013): albumin, alkaline phosphatase (log), blood urea nitrogen, creatinine (log), C-reactive protein (log), HbA1C, systolic blood pressure, and forced expiratory volume in 1 second (FEV1). We omitted total cholesterol because of evidence this biomarker shows different directions of association with aging in younger and older adults (Arbeev et al., 2016). Cytomegalovirus optical density was not available in the Understanding Society database.
Self-rated Health. Understanding Society participants rated their health as excellent, verygood, good, fair, or poor. We standardized this measure to have Mean=0, Standard Deviation=1 for analysis.

The Normative Aging Study (NAS)
is an ongoing longitudinal study on aging established by the US Department of Veterans Affairs in 1963. Details of the study have been published previously (Bell et al., 1972). Briefly, the NAS is a closed cohort of 2,280 male veterans from the Greater Boston area enrolled after an initial health screening to determine that they were free of known chronic medical conditions. Participants have been re-evaluated every 3-5 years on a continuous rolling basis using detailed on-site physical examinations and questionnaires. DNA from blood samples was collected from 771 participants beginning in 1999. We analyzed blood DNA methylation data from up to four repeated assessments conducted through 2013 (Gao et al., 2019;Panni et al., 2016). Of the 771 participants with DNA methylation data, n=536 (70%) had data from 2 repeated assessments and n=178 (23%) had data from three or four repeated assessments. We restricted the current analysis to participants with at least one DNA methylation . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint data point. The NAS was approved by the Department of Veterans Affairs Boston Healthcare System and written informed consent was obtained from each subject before participation.
Mortality. Regular mailings to study participants have been used to acquire vital-status information and official death certificates were obtained from the appropriate state health department to be reviewed by a physician. Participant deaths are routinely updated by the research team and the last available update was on 31 December 2013. During follow-up, n=354 (46%) of the 771 NAS participants died.
Chronic Disease Morbidity. We measured chronic disease morbidity from participants medical histories and prior diagnoses (Gao et al., 2021;Lepeule et al., 2018). We counted the number of chronic diseases to compose an ordinal index with categories of 0, 1, 2, 3, or 4+ of the following comorbidities: hypertension, type-2 diabetes, cardiovascular disease, chronic obstructive pulmonary disease, chronic kidney disease, and cancer. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint assessments. This sample comprised 56% monozygotic (MZ) and 44% dizygotic (DZ) twin pairs; sex was evenly distributed within zygosity (49% male). Families were recruited to represent the UK population of families with newborns in the 1990s, on the basis of residential location throughout England and Wales and mother's age. Teenaged mothers with twins were over-selected to replace high-risk families who were selectively lost to the register through nonresponse. Older mothers having twins via assisted reproduction were under-selected to avoid an excess of well-educated older mothers. The study sample represents the full range of socioeconomic conditions in the UK, as reflected in the families' distribution on a neighborhoodlevel socioeconomic index (ACORN [A Classification of Residential Neighborhoods], developed by CACI Inc. for commercial use): 25.6% of E-Risk families lived in "wealthy achiever" neighborhoods compared to 25.3% nationwide; 5.3% vs. 11.6% lived in "urban prosperity" neighborhoods; 29.6% vs. 26.9% lived in "comfortably off" neighborhoods; 13.4% vs. 13.9% lived in "moderate means" neighborhoods, and 26.1% vs. 20.7% lived in "hard-pressed" neighborhoods. E-Risk underrepresents "urban prosperity" neighborhoods because such households are likely to be childless.
Home-visits assessments took place when participants were aged 5, 7, 10, 12, and 18 years, when 93% of the participants took part. At ages 5, 7, 10, and 12 years, assessments were carried out with participants as well as their mothers (or primary caretakers); the home visit at age 18 included interviews only with participants. Each twin was assessed by a different interviewer. These data are supplemented by searches of official records and by questionnaires that are mailed, as developmentally appropriate, to teachers, and co-informants nominated by participants themselves. The Joint South London and Maudsley and the Institute of Psychiatry Research Ethics Committee approved each phase of the study. Parents gave informed consent and twins gave assent between 5-12 years and then informed consent at age 18. Study data may be accessed through agreement with the Study investigators (https://moffittcaspi.trinity.duke.edu/research-topics/erisk).
Childhood Socioeconomic Status (SES) was defined through a standardized composite of parental income, education, and occupation (Trzesniewski et al., 2006). The three SES indicators were highly correlated (r=0.57-0.67) and loaded significantly onto one factor. The populationwide distribution of the resulting factor was divided in tertiles for analyses.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint Childhood Victimization. As previously described (Danese et al., 2016), we assessed exposure to six types of childhood victimization between birth to age 12: exposure to domestic violence between the mother and her partner, frequent bullying by peers, physical and sexual harm by an adult, and neglect. The new Dunedin Pace of Aging methylation algorithm (DunedinPACE) was developed using elastic-net regression analysis carried out in the Dunedin Study, as described in detail in the Results. The criterion variable was Pace of Aging. Development of the Pace of Aging is described in detail elsewhere Elliott et al., 2021). Briefly, we conducted mixed-effects growth modeling of longitudinal change in 19 biomarkers measuring integrity of . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021.

Statistical Analysis
We conducted analysis of Dunedin, GSE55763, Understanding Society, NAS, Framingham, and E-Risk, data using regression models. We analyzed continuous outcome data using linear regression. We analyzed count outcome data using Poisson regression. We analyzed time-to-event outcome data using Cox proportional hazard regression. To estimate intraclass correlation coefficients for technical replicates in GSE55763, we used mixed-effects regression models. For analysis of repeated-measures longitudinal DNA methylation data in the NAS, we used generalized estimating equations to account for non-independence of repeated observations of individuals (Ballinger, 2004), following the method in previous analysis of those data (Gao et . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. Code for analysis. Code used for analysis and to prepare figures will be accessible via GitHub upon acceptance of the article for publication.
Code to calculate DunedinPACE from Illumina 450k or Epic Array Data. R code to implement DunedinPACE will be available via GitHub upon publication of the article.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint of approximately 20 Study members. Regression slopes are estimated from the raw, un-binned data. In Panel C, y-axis scales are denominated in units of standard deviations computed from the baseline measurement for all outcomes except self-rated health, for which the y-axis shows probability of incident fair/poor self-rated health. Changes were calculated over the interval between age-38 and age-45 assessments for all measures except cognition, for which change was calculated over the interval between the age-13 and age-45 assessments.   is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint of the two cohorts: those with DunedinPACE 1 SD or more below the mean ('slow' DunedinPACE, blue line); those with DunedinPACE within 1 SD of the mean ('average' DunedinPACE, gray line); and those with DunedinPACE 1 SD or more above the mean ('fast' DunedinPACE, red line). Censoring of participants prior to death is indicated with a gold hash marks. The table below the figure details the number of participants at risk per 3 year interval and, in parentheses, the number who died during the interval. Panels B and D show plots of effect-sizes for comparisons of risk groups. In Panel B, the reference category is the group who grew up in high-SES households. In Panel D, the reference category is the group for whom no victimization was recorded during childhood. The plotted points and error bars correspond to Cohen's d effect-sizes and 95% confidence intervals for four models: a base model including covariate adjustment for sex; a model including additional covariate adjustment for estimated cell counts; a model including additional covariate adjustment for smoking status and quantity at age 18, and a model including data only for E-Risk Study members who had never smoked. The figure shows that childhood adversity associations with DunedinPACE are mostly independent of variation in blood sample cell composition and smoking.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. Effect-sizes are reported as standardized regression coefficients interpretable as Pearson r values. Error bars show 95% confidence intervals. DNA methylation clocks were residualized for chronological age prior to analysis. Models included covariates for chronological age and sex. Physiology-based measures of biological age were computed from Understanding Society biomarker data (albumin, alkaline phosphatase, creatinine, C-reactive protein, blood urea nitrogen, glycated hemoglobin, systolic blood pressure, and forced expiratory volume in 1 second) based on algorithms derived in data from the US NHANES according is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint from Cox proportional hazard regression. For both panels, error bars show 95% confidence intervals. DNA methylation clocks were residualized for chronological age prior to analysis. All models included covariates for chronological age and sex. Figure 5. Effect-sizes for associations of childhood socioeconomic conditions and childhood victimization history with DunedinPACE, DunedinPoAm, and DNA methylation clocks at age 18. Panel A plots effect-sizes for low socioeconomic status (SES) in childhood (vs. high SES in childhood). Panel B plots effect-sizes for childhood polyvictimization (vs. no childhood victimization). Effect-sizes are reported as Cohen's d. For both panels, error bars show 95% confidence intervals. DNA methylation clocks were residualized for chronological age prior to analysis. All models included sex as a covariate. (All E-Risk participants are the same chronological age; no age covariate is needed.)

Supplemental
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
Gao, X., Coull, B., Lin, X., Vokonas, P., Spiro, A., Hou, L., Schwartz, J., and Baccarelli, A.A. (2021). Short-term air pollution, cognitive performance and nonsteroidal anti-inflammatory drug use in the Veterans Affairs Normative Aging Study. Nat. Aging 1-8. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
Lepeule, J., Litonjua, A.A., Gasparrini, A., Koutrakis, P., Sparrow, D., Vokonas, P.S., and Schwartz, J. (2018). Lung function association with outdoor temperature and relative humidity and its interaction with air pollution in the elderly. Environ. Res. 165, 110-117. Levine, M.E. (2013). Modeling the rate of senescence: Can estimated biological age predict mortality more accurately than chronological age? J. Gerontol . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; Figure 1. DunedinPACE provides a proxy measure of the 20-year Pace of Aging. Panel A shows the association of DunedinPACE with the 20-year Pace of Aging in the Dunedin Study cohort (r=0.78). Panel B shows effect-sizes for associations of DunedinPACE with measures of physical and cognitive functioning and subjective signs of aging in Dunedin Study members at age 45 years. Colors indicate groupings of measures (physical functions in shades of orange, cognitive functions in shades of yellow, subjective signs of aging in shades of green). Panel C shows binned scatterplots of associations of DunedinPACE with declines in physical and cognitive function and subjective signs of aging. Plotted points show average values for "bins" of approximately 20 Study members. Regression slopes are estimated from the raw, un-binned data. In Panel C, y-axis scales are denominated in units of standard deviations computed from the baseline measurement for all outcomes except self-rated health, for which the y-axis shows probability of incident fair/poor self-rated health. Changes were calculated over the interval between age-38 and age-45 assessments for all measures except cognition, for which change was calculated over the interval between the age-13 and age-45 assessments. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ;  CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; Figure 3. Associations of DunedinPACE with chronological age, epigenetic clocks, Physiology-based measures of biological age, and self-rated health in Understanding Society. Panel A shows a scatterplot and fitted slopes illustrating the association between chronological age (x-axis) and DunedinPACE (y-axis) in women and men in the Understanding Society sample. Data for women are plotted with yellow dots (orange slope) and for men with blue crosses (navy slope). The figure illustrates a positive association between chronological age and DunedinPACE (Pearson r = 0.32). Panel B shows a matrix of correlations and association plots among DunedinPACE and age-acceleration residuals of Horvath, Hannum, Levine-PhenoAge and Lu-GrimAge epigenetic clocks. The diagonal cells of the matrix list the DNA methylation measures. The half of the matrix below the diagonal shows scatter plots of associations. For each scatter-plot cell, the y-axis corresponds to the variable named along the matrix diagonal to the right of the plot and the x-axis corresponds to the variable named along the matrix diagonal above the plot. The half of the matrix above the diagonal lists Pearson correlations between the DNA methylation measures. For each correlation cell, the value reflects the correlation of the variables named along the matrix diagonal to the left of the cell and below the cell.  . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ;

Figure 4. Association of DunedinPACE with mortality. Panel A shows mortality in the Normative Aging Study (NAS). Panel B shows mortality in the Framingham Heart Study
Offspring Cohort. The figure plots Kaplan-Meier curves for three groups of participants in each of the two cohorts: those with DunedinPACE 1 SD or more below the mean ('slow' DunedinPACE, blue line); those with DunedinPACE within 1 SD of the mean ('average' DunedinPACE, gray line); and those with DunedinPACE 1 SD or more above the mean ('fast' DunedinPACE, red line). Censoring of participants prior to death is indicated with a gold hash marks. The table below the figure details the number of participants at risk per 3 year interval and, in parentheses, the number who died during the interval. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ;

Figure 5. DunedinPACE levels by strata of childhood socioeconomic status (SES) and victimization in the E-Risk Study. Panel A (left side) plots means and 95% CIs for
DunedinPACE measured at age 18 among E-Risk participants who grew up low, middle, and high socioeconomic status households. Panel B (right side) plots means and 95% CIs for DunedinPACE measured at age 18 among E-Risk participants who experienced 0, 1, 2, or three or more types of victimization through age 12 years. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021.   ICC . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Supplemental Figure 3. Effect-sizes for associations of DunedinPACE, DunedinPoAm, and DNA methylation clocks with physiology-based measures of biological age and self-rated health. Figure shows effect-sizes estimated from the Understanding Society data (n=1,175). Effect-sizes are reported as standardized regression coefficients interpretable as Pearson r values. Error bars show 95% confidence intervals. DNA methylation clocks were residualized for chronological age prior to analysis. Models included covariates for chronological age and sex. Physiology-based measures of biological age were computed from Understanding Society biomarker data (albumin, alkaline phosphatase, creatinine, C-reactive protein, blood urea nitrogen, glycated hemoglobin, systolic blood pressure, and forced expiratory volume in 1 second) based on algorithms derived in data from the US NHANES according to the methods developed by Levine et al. (Levine et al., 2018), Klemera andDoubal (Klemera andDoubal, 2006), andCohen et al. (Cohen et al., 2013). Effect-size -Pearson r . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Supplemental Figure 4. Effect-sizes for associations of DunedinPACE, DunedinPoAm, and DNA methylation clocks with morbidity and mortality. Panel A plots effect-sizes for analysis of incident and prevalent chronic disease morbidity in the Normative Aging Study. Effect-sizes are reported as relative risks (RR) per standard deviation increment of the aging measures estimated from Poisson regression. Panel B plots effect-sizes for analysis of mortality in the Normative Aging Study and the Framingham Heart Study Offspring cohort. Effect-sizes are reported as hazard ratios (HR) per standard deviation increment of the aging measures estimated from Cox proportional hazard regression. For both panels, error bars show 95% confidence intervals. DNA methylation clocks were residualized for chronological age prior to analysis. All models included covariates for chronological age and sex. A. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Supplemental Figure 5. Effect-sizes for associations of childhood socioeconomic conditions and childhood victimization history with DunedinPACE, DunedinPoAm, and DNA methylation clocks at age 18. Panel A plots effect-sizes for low socioeconomic status (SES) in childhood (vs. high SES in childhood). Panel B plots effect-sizes for childhood polyvictimization (vs. no childhood victimization). Effect-sizes are reported as Cohen's d. For both panels, error bars show 95% confidence intervals. DNA methylation clocks were residualized for chronological age prior to analysis. All models included sex as a covariate. (All E-Risk participants are the same chronological age; no age covariate is needed.) . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ;

Supplemental Table 1. Measurements included in modeling Pace of Aging in the Dunedin
Study Birth Cohort. We measured Pace of Aging from repeated assessments of a panel of 19 biomarkers: Body mass index (BMI), Waist-hip ratio, Glycated hemoglobin, Leptin, Blood pressure (mean arterial pressure), Cardiorespiratory fitness (VO2Max), Forced vital capacity ratio (FEV1/FVC), Forced expiratory volume in one second (FEV1), Total cholesterol, Triglycerides, High density lipoprotein (HDL), Lipoprotein(a), Apolipoprotein B100/A1 ratio, estimated Glomerular Filtration Rate (eGFR), Blood Urea Nitrogen (BUN), High Sensitivity C-reactive Protein (hs-CRP), White blood cell count, mean periodontal attachment loss (AL), and the number of dental-caries-affected tooth surfaces (tooth decay). This list includes two new biomarkers, leptin and caries, not included in the original Pace of Aging , both of which have now been assessed at multiple waves, allowing growth curve modeling. Telomere length was dropped because of an emerging and yet-unresolved field-wide debate about its measurement (Nettle et al., 2021). All other biomarkers were the same. The 20-year Pace of Aging measure is described in detail elsewhere (Elliott et al., 2021).

Body mass index
Height was measured to the nearest millimeter using a Seca 264 Wireless Stadiometer. Weight was measured to the nearest 0.1 kg using calibrated scales. Individuals were weighed in light clothing. Body mass index (BMI) was calculated using the standard formula weight in kilograms divided by height in meters squared.

Waist-hip ratio
Waist girth was the perimeter at the level of the noticeable waist narrowing located between the costal border and the iliac crest. Hip girth was taken as the perimeter at the level of the greatest protuberance and at about the symphysion pubic level anteriorly. Measurements were repeated and the average used to calculate waisthip ratio.

Glycated hemoglobin (HbA1C)
Whole blood glycated hemoglobin concentration (expressed as a percentage of total hemoglobin) was measured by ion exchange high performance liquid chromatography (BioRad D-100, Hercules, Calif.), a method certified by the US National Glycohemoglobin Standardization Program (http://www.ngsp.org/).

Cardiorespiratory fitness (VO2Max)
Cardiorespiratory fitness was assessed by measuring heart rate in response to a submaximal exercise test on a friction-braked cycle . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; ergometer. Dependent on the extent to which heart rate increased during a 2-min 50 W warm-up, the workload was adjusted to elicit a steady heart-rate in the range 130-170 beats per minute. After a further 6-min constant power output stage, the maximum heart rate was recorded and used to calculate predicted maximum oxygen uptake adjusted for body weight in milliliters per minute per kilogram (VO2max) according to standard protocols.

Lung function (FEV1 and FEV1/FVC)
We calculated post-albuterol forced expiratory volume in one second (FEV1) and the ratio of FEV1 to forced vital capacity (FVC; FEV1/FVC) using measurements from spirometry conducted with a Sensormedics body plethysmograph (Sensormedics Corporation, Yorba Linda, CA, USA).

High sensitivity Creactive protein (hs-CRP)
Serum C-reactive protein (mg/L) was measured by high sensitivity immunoturbidimetric assay on a Hitachi 917 analyzer (age 32), a Modular P analyzer (age 38), and a Cobas c702 (age 45). Values were log-transformed for analysis.

Mean periodontal attachment loss (AL)
Calibrated dentists used a PCP-2 periodontal probe (Hu-Friedy; Chicago) to measure gingival recession (the distance from the cementoenamel junction to the gingival margin) and probing depth (the distance from the probe tip to the gingival margin) in millimeters . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10. 1101 at three sites (mesiobuccal, buccal, and distolingual) per tooth, excluding third molar teeth. Periodontal attachment loss for each site was computed by summing gingival recession and probing depth. and then averaged across all periodontally examined teeth. Periodontal examinations were conducted with half-mouth examinations at age 26 and full-mouth examinations at ages 32, 38, and 45 years.

Caries-affected tooth surfaces
Calibrated dentists examined the teeth for caries and restorations following the World Health Organization Oral Health Surveys methodology. Four surfaces were considered for anterior teeth (canines and incisors): buccal, lingual, distal, and mesial; a fifth surface, occlusal, was considered for premolar and molar teeth. Tooth surfaces were classified as having untreated caries (DS) if a cavitated carious lesion was present, as filled (FS) if a dental restoration was present (including crowns), and missing due to caries (MS) if the participant indicated that a given tooth had been removed due to decay or failed dental restorative work. DS, MS, and FS counts were summed to obtain a DMFS score (ranging from 0 to 148 surfaces). Caries experience was expressed as the % of tooth surfaces that had been caries-affected, excluding surfaces of teeth that were unerupted, lost due to trauma, extracted for reasons other than caries (impaction, orthodontic treatment, or periodontal disease), or could not be visualized by the examiner.

Supplemental Table 3. Physical and cognitive functioning and subjective signs of aging measures in the Dunedin Study
Physical Functioning (N=817 with DNAm data) Balance Balance was measured using the Unipedal Stance Test as the maximum time achieved across three trials of the test with eyes closed Springer et al., 2007;Vereeck et al., 2008). Gait Speed Gait speed (meters per second) was assessed with the 6-m-long GAITRite Electronic Walkway (CIR Systems, Inc) with 2-m acceleration and 2-m deceleration before and after the walkway, respectively. Gait speed was assessed under 3 walking conditions: usual gait speed (walk at normal pace from a standing start, measured as a mean of 2 walks) and 2 challenge paradigms, dual-task gait speed (walk at normal pace while reciting alternate letters of the alphabet out loud, starting with the letter "A," measured as a mean of 2 walks) and maximum gait speed (walk as fast as safely possible, measured as a mean of 3 walks). We calculated the mean of the 3 individual walk conditions to generate our primary measure of composite gait speed .

Steps in Place
The 2-min step test was measured as the number of times a participant lifted their right knee to mid-thigh height (measured as the height half-way between the knee cap and the iliac crest) in 2 minutes at a self-directed pace . Chair Stands Chair rises were measured as the number of stands a participant completed in 30 seconds from a seated position .

Grip Strength
Handgrip strength was measured for the dominant hand (elbow held at 90°, upper arm held tight against the trunk) as the maximum value achieved across three trials using a Jamar digital dynamometer (Mathiowetz et al., 1985;.

Motor Coordination
At ages 38 and 45, we measured motor functioning as the time to completion of the Grooved Pegboard Test with the dominant hand. Physical Limitations Physical limitations were measured with the 10-item RAND 36-Item Health Survey 1.0 physical functioning scale (Ware and Sherbourne, 1992). Participant responses ("limited a lot", "limited a little", "not limited at all") assessed their difficulty with completing various activities, e.g., climbing several flights of stairs, walking more than 1 km, participating in strenuous sports, etc. Scores were reversed to reflect physical limitations so that a high score indicates more limitations.

Decline in Physical Functioning
Tests of balance and grip strength and interviews about physical limitations were completed at both the age-38 and age-45 Dunedin Study assessments. We measured decline across the 7-year measurement interval by subtracting the age-38 test score from the age-45 test score. Cognitive Functioning (N=814 with DNAm data) Cognitive Functioning The Wechsler Adult Intelligence Scale-IV (WAIS-IV) (Wechsler, 2008) was administered to the participants at age 45 years, yielding the IQ. In addition to full scale IQ, the WAIS-IV measures four specific domains of cognitive function: Processing Speed, Working Memory, Perceptual Reasoning, and Verbal Comprehension.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint Supplemental Table 4. Effect-sizes for analysis of DunedinPACE, DunedinPoAm, and DNA methylation clocks in models with covariate adjustment for age and sex only and for models with additional covariate adjustment for DNA methylation estimated cell counts and for smoking. Effect-sizes are reported for a one standard deviation increment of DunedinPACE. Outcome variables in Dunedin Study, Understanding Society Study, and E-Risk Study (Panels A, B, C, and E) were z-transformed for analysis; effect-sizes are interpretable as Pearson r or Cohen's d. For Normative Aging Study and Framingham Analysis (Panel C), effectsizes for time-to event outcomes are reported as hazard ratios (HR) and effect-sizes for prevalent chronic disease are reported as Risk Ratios.
Panel A. Effect-sizes for associations of aging measures with functional assessments at age 45 in the Dunedin Study. Effect-sizes are standardized coefficients from linear regressions of the outcomes on the aging measures. The models adjusted for cell counts (middle column of results) were adjusted for cell counts measured from complete blood count data (white blood cell count and percentages of lymphocytes, monocytes, neutrophils, eosinophils, and basophils). The models adjusted for smoking (left-most column of results) included a covariate for number of pack-years smoked.
Panel B. Effect-sizes for associations of aging measures with functional decline in the Dunedin Study. Decline was measured as change between the age-38 and age-45 assessments for all measures except cognition, for which change was computed over the interval beginning with an adolescent baseline test at age 13 and ending with the most recent cognitive assessment at age 45. Effect-sizes are coefficients from linear regressions of change scores on aging measures for all outcomes except incident fair/poor health, for which effect-sizes are risk ratios (RR) estimated from Poisson regression of incidence on the aging measures. Change scores were computed by standardizing baseline and follow-up measurements using the mean and standard deviation (SD) of the baseline assessment and computing the difference in standardized scores. A single unit of change therefore corresponds to 1 SD of variation at baseline. Effect-sizes are reported for a one SD difference in the aging measures. The models adjusted for smoking (leftmost column of results) included a covariate for number of pack-years smoked.
Panel C. Effect-sizes for associations of aging measures with clinical-biomarker measures of biological age, and self-rated health in the Understanding Society Study. Effect-sizes are standardized coefficients from linear regressions of the outcomes on the DNA methylation measures of aging. The models adjusted for smoking included covariates for smoking status and current smoking quantity (cigarettes per day).

Panel D. Effect-sizes for associations of aging measures with mortality and morbidity in the
Normative Aging Study and the Framingham Heart Study Offspring Cohort. Effect-sizes are standardized coefficients from linear regressions of the outcomes on the DNA methylation measures of aging. Analysis of mortality was conducted in both cohorts. Analysis of incident and prevalent chronic disease morbidity was conducted only in the Normative Aging Study. The models adjusted for smoking included covariates for smoking history (pack-years smoked) in the Normative Aging Study and for smoking status and current smoking quantity (cigarettes per day) in the Framingham Heart Study. Effect-sizes are hazard ratios (HR) from Cox proportional . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10.1101/2021.08.30.21262858 doi: medRxiv preprint hazard models for analysis of mortality and incident morbidity and risk ratios (RR) from mixedeffects Poisson regression models for prevalent morbidity.

Panel E. Effect-sizes for associations of childhood socioeconomic status (SES) and
childhood victimization with aging measures in the E-Risk Study. Effect-sizes are standardized coefficients (Cohen's d) from linear regressions of the aging measures on exposure groups (low and middle SES vs. high SES; one, two, and three or more types of victimization vs. no victimization). Models included covariate adjustment for sex. Standard errors were clustered at the family level to account for non-independence of sibling data.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted September 2, 2021. ; https://doi.org/10. 1101