Abstract
Objectives. We investigated the quality of 162 variables, focusing on the contribution of genetic markers, used solely or in combination with other characteristics, when predicting mortality.
Methods. In 5974 participants from the Rotterdam Study, followed for a median of 15.1 years, 7 groups of factors including age and gender, genetics, socioeconomics, lifestyle, physiological characteristics, prevalent diseases, and indicators of general health were related to all-cause mortality. Genetic variables were identified from 8 genome-wide association scans (n = 19 033) and literature review.
Results. We observed 3174 deaths during follow-up. The fully adjusted model (C-statistic for 15-year follow-up [C15y] = 0.80; 95% confidence interval [CI] = 0.75, 0.77) predicted mortality well. Most of the additional information apart from age and sex stemmed from physiological markers, prevalent diseases, and general health. Socioeconomic factors and lifestyle contributed meaningfully to mortality risk prediction with longer prediction horizon. Although specific genetic factors were independently associated with mortality, jointly they contributed little to mortality prediction (C15y = 0.56; 95% CI = 0.55, 0.57).
Conclusions. Mortality can be predicted reasonably well over a long period. Genetic factors independently predict mortality, but only modestly more than other risk indicators.
In the 20th century, life expectancy at birth increased from 50 years to over 80 years in Western countries.1 Demographers repeatedly predicted that it had reached a ceiling, but life expectancy in record countries continues to rise by an average of 3 months each year.2 Although epidemiological research has identified numerous predictors of mortality, information about their comparative effect sizes and long-term predictive power is sparse. Prior research has often been limited by a short period of follow-up, a limited set of covariates, or a focus on cause-specific mortality. Only a few studies have evaluated the potential for explaining mortality from a broader perspective by jointly analyzing demographic characteristics, lifestyle factors, and indicators of health and disease.3–7 It is still unclear whether genetic information can be used to predict mortality, but recent advances in genomic technology allow for the inclusion of genetic markers in the prediction of mortality.
We combined traditional indicators of mortality risk with genetic factors, derived from a meta-analysis of 8 genome-wide association studies and the literature, and associated them with mortality over 15 years of follow-up. Our aims were twofold: first, to identify independent determinants of mortality by analyzing 162 a priori identified risk factors; second, to provide information on the independent and combined potential of genetic markers in predicting mortality.
METHODS
The Rotterdam Study is a population-based prospective cohort initiated in 1990. It was designed to investigate risk factors for diseases in 7983 participants aged 55 years or older in the Ommoord district of Rotterdam, The Netherlands.8,9 In the initial and subsequent investigation waves, trained research assistants collected data on health, medication use, medical and family history, and lifestyle factors in extensive home interviews. Participants subsequently visited the research center for clinical examinations, with a special emphasis on imaging and on collecting and storing biospecimens to facilitate in-depth molecular and genetic analyses. Data analyzed in this study concern 5974 participants with genetic information available from the first wave of the Rotterdam Study.
Predictors
We organized baseline data into related groups: age and gender, genetics, socioeconomics, lifestyle, physiology, diseases, and general health. We hypothesized a priori that genetics, socioeconomics, and lifestyle were associated with long-term health effects, whereas physiology, disease, and general health were more likely associated with short-term mortality.10 Overall, we analyzed 162 risk indicators in this study: 69 previously studied risk factors for mortality and 93 single nucleotide polymorphisms (SNPs).
To study possible genetic risk factors of mortality, we used the genetic data of 19 033 participants (women = 55%) aged 55 years and older from 8 discovery cohorts of European ancestry. These people were participants not of the Rotterdam Study but of the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE), of which the Rotterdam Study is a part.11 We identified the top 50 loci from the meta-analysis of genome-wide association studies on time to death. In addition, we used 43 SNPs that mapped to genes from a seminal review.1 For the analysis, we extracted all SNPs from the imputed gene information of the Rotterdam Study, except for apolipoprotein E (APOE), which we genotyped directly.12 Information about cohort characteristics, genotyping, and imputation of the discovery set is presented in Tables A and B (available as a supplement to the online version of this article at http://www.ajph.org).
We included the following variables describing the socioeconomic status of the study population: education, employment status, monthly income, social class (derived from occupation of head of the household), health insurance status, number of children, whether living independently or in a nursing home, whether living with a partner, and death of spouse.8,9 We included the following indicators of lifestyle: riding a bike, alcohol consumption, smoking, energy intake, and fruit and vegetable consumption.13,14
We assessed physiological characteristics in terms of body weight, body mass index, waist circumference, hip circumference, waist-to-hip ratio, sitting systolic and diastolic blood pressures, leukocyte count, erythrocyte sedimentation rate, albumin level, total cholesterol level, high-density lipoprotein cholesterol, creatinine, uric acid, serum C-reactive protein, postload insulin, bone mineral density of the femoral neck and lumbar spine, and atherosclerotic plaques. We assessed all physiological variables using standard medical, laboratory, and imaging procedures as described previously.15–20
On the basis of self-report, investigations at the baseline center visit, medical record information, and drug utilization, we defined the following prevalent diseases: diabetes, left ventricular hypertrophy, atrial fibrillation, hypertension, hip fracture, peripheral artery disease, myocardial infarction, heart failure, dementia, gout, Parkinson’s disease, stroke, transient ischemic attack, cancer, cognitive function (Mini-Mental State Examination), and coronary operation.21–27
General health included the following: activities of daily living28 and instrumental activities of daily living,29 health care utilization, self-perceived comparative health, accidental falls, shortness of breath, past serious illness and hospitalization, unintentional weight loss, and self-reported memory complaints.
Outcome
The outcome measure was time to death from any cause. All participants of the Rotterdam Study were under continuous surveillance; general practitioners’ and hospital records, as well as death certificates from the municipality, were used to identify participants who died before January 1, 2009. The median follow-up was 15.1 years (range = 0.05–19.50).
Statistical Analysis
We used SAS 9.2 (SAS Institute Inc, Cary, NC) and the PROC MI procedure to impute to 5 complete data sets. We set the maximum missingness for analysis of the data at 30% a priori. The percentage of missing information is reported in Table C (available as a supplement to the online version of this article at http://www.ajph.org). Other than age, all continuous variables were standardized to facilitate the comparison of effect sizes. The estimates represent the effect of a change of 1 standard deviation.
We analyzed the risk indicators and their association with mortality with Cox proportional hazard models. We calculated unadjusted hazard ratios and the confidence intervals of each, and subsequently optimized the predefined groups separately, adjusting for age and gender within each group, by means of stacked imputed backward regression, with P = .2 as the exclusion threshold.30 Finally, we combined all remaining variables in a final model using backward regression. We also used least absolute shrinkage and selection operator (LASSO) penalized regression as implemented in R (R Development Core Team, Vienna, Austria) in the package “penalized” to validate the results from backward regressions.31,32 We evaluated the proportional hazard assumption using Schoenfeld residuals.33 Variables that did not fulfill the proportional hazards assumption in the imputed datasets were modeled with piecewise constant Heaviside functions.34 We considered P < .05 from 2-sided tests as statistically significant.
We used time-dependent receiver operating characteristic (ROC) curves to compare the predictive performance of the different variable groups over time.35 ROC curves describe the relationship between sensitivity (true positive rate) and 1 − specificity (false positive rate) for all possible cutoffs of a marker to distinguish between high-risk participants and low-risk participants. We also computed the C-index, the probability that a participant who dies on any given day during a specified time interval has a higher predictive score than one who survives beyond that day. For this part of the analysis, we accounted for residual time dependency using Schoenfeld smoothing.35 We estimated confidence intervals using cross-validation in 500 bootstrap samples.
RESULTS
At baseline, participants were on average 69 years of age (range = 55–99 years; Table 1) and 59% were female. Of the 5974 participants, 3174 died (mortality rate = 4.2 per 100 person-years) during the follow-up period. From 162 a priori identified risk factors of mortality (supplemental Table C), backward regression retained 108 variables independently in the final model. Of these, 36 were significantly related to mortality (P < .05) as independent risk factors (Table 1). Age (hazard ratio [HR] = 1.09; 95% confidence interval [CI] = 1.08, 1.10) and female gender (HR = 0.71; 95% CI = 0.62, 0.81) were strongly associated with mortality (Table 1).
TABLE 1—
Descriptive Statistics (Unadjusted and Completely Adjusted Models) and Association to 15-Year Mortality: The Rotterdam Study, 1990–2009
Baseline, Mean ±SD or No. (%) | Univariate |
Final Model |
||||
Variable | RR (95% CI) | P | RR (95% CI) | P | ||
Age, y | 69.43 ±9.10 | 1.12 (1.11, 1.12) | < .001 | 1.09 (1.08, 1.10) | < . 001 | |
Female gender | 3547 ±59.38 | 0.79 (0.73, 0.85) | < .001 | 0.71 (0.62, 0.81) | < . 001 | |
Genetics (rs, gene, Chr, allele 1/2)a | ||||||
Candidate genes from literature | ||||||
APOE ε 4 allele, chr 19 | 0.28 | 1.01 (0.95, 1.08) | .71 | 1.10 (1.03, 1.19) | .007 | |
rs6997892, WRN, chr 8 (G/A) | 0.88 | 0.97 (0.91, 1.05) | .48 | 0.92 (0.86, 1.00) | .04 | |
rs2684766, IGF1R, chr 15 (T/C) | 0.97 | 1.03 (0.88, 1.19) | .75 | 1.19 (1.00, 1.40) | .05 | |
rs11630259, IGF1R, chr 15 (T/C) | 0.73 | 1.04 (0.99, 1.10) | .15 | 1.09 (1.02, 1.17) | .02 | |
GWA continuous mortality selection | ||||||
rs10817931, TRIM32, chr 9(A/C) | 0.38 | 1.02 (0.97, 1.07) | .54 | 1.07 (1.01, 1.13) | .01 | |
rs1421783, MAT2B, chr 5(C/G) | 0.93 | 0.94 (0.85, 1.04) | .24 | 0.89 (0.80, 1.00) | .05 | |
Socioeconomic and lifestyle characteristics | ||||||
Social class (min = 1, max = 4) | 2.59 ±1.19 | 0.87 (0.84, 0.90) | < .001 | 0.95 (0.91, 0.99) | .03 | |
Living situation | < .001 | .05 | ||||
Independent | 4941 (82.71) | 1.00 (Ref) | 1.00 (Ref) | |||
Service flat | 610 (10.21) | 2.88 (2.61, 3.18) | 1.04 (0.93, 1.16) | |||
Home for elderly | 423 (7.08) | 9.06 (8.12, 10.12) | 1.26 (1.03, 1.47) | |||
Smoking | .53 | < .001 | ||||
Never | 2101 (35.17) | 1.00 (Ref) | 1.00 (Ref) | |||
Former | 2491 (41.70) | 0.94 (0.87, 1.02) | 1.07 (0.96, 1.19) | |||
Current | 1382 (23.13) | 1.12 (1.02, 1.23) | 1.45 (1.27, 1.66) | |||
Pack-years | 16.58 ±23.15 | 1.12 (1.08, 1.16) | < .001 | 1.07 (1.03, 1.12) | < .001 | |
Nutrition: energy intake, kJ | 8280.21 ± 2133 | 1.05 (0.99, 1.11) | .11 | 1.08 (1.02, 1.13) | .006 | |
Physiological characteristics | ||||||
Diastolic blood pressure, mmHg | 73.71 ±11.50 | 0.99 (0.95, 1.02) | .49 | 1.05 (1.00, 1.11) | .04 | |
Systolic blood pressure, mmHg | 139.37 ±22.30 | 1.37 (1.32, 1.41) | < .001 | 1.06 (1.00, 1.13) | .03 | |
Body mass index, kg/m2 | 26.30 ±3.71 | 0.93 (0.89, 0.97) | .05 | 0.86 (0.80, 0.92) | < .001 | |
Body mass index squared, (kg/m2)2 | 705.33 ±205.78 | 1.04 (1.02, 1.06) | .05 | 1.03 (1.01, 1.05) | < .001 | |
Waist circumference, cm (SD) | 90.57 ±11.17 | 1.21 (1.16, 1.25) | < .001 | 1.10 (1.04, 1.17) | .002 | |
Erythrocyte sedimentation, mm/h | 13.53 ±11.89 | 1.32 (1.28, 1.37) | < .001 | 1.08 (1.02, 1.14) | .006 | |
Leukocytes, × 10 9/L | 6.70 ±1.92 | 1.18 (1.15, 1.21) | < .001 | 1.11 (1.07, 1.15) | < .001 | |
Creatinine, μmol/L | 83.18 ±20.52 | 1.20 (1.18, 1.22) | < .001 | 1.06 (1.01, 1.12) | .03 | |
C-reactive protein | 3.36 ±6.61 | 1.21 (1.19, 1.23) | < .001 | 1.07 (1.03, 1.10) | < .001 | |
Total cholesterol, mmol/L | 6.60 ±1.22 | 0.81 (0.78, 0.84) | < .001 | 0.92 (0.89, 0.96) | < .001 | |
Bone mineral density of femoral neck | 0.83 ±0.14 | 0.77 (0.71, 0.83) | < .001 | 0.93 (0.88, 0.99) | .01 | |
Aortic calcification | 1.80 ±1.49 | 1660 (1.55, 1.76) | < .001 | 1.08 (1.01, 1.16) | .03 | |
Disease characteristics | ||||||
Diabetes mellitus (yes vs no) | 618 (10.35) | 2.11 (1.92, 2.33) | < .001 | 1.39 (1.25, 1.55) | < .001 | |
Left ventricular hypertrophy (yes vs no) | 258 (4.32) | 2.35 (2.04, 2.70) | < .001 | 1.33 (1.13, 1.55) | < .001 | |
Atrial fibrillation (yes vs no) | 318 (5.32) | 3.28 (2.89, 3.73) | < .001 | 1.32 (1.15, 1.51) | < .001 | |
Peripheral artery disease (yes vs no) | 1133 (18.97) | 2.63 (2.42, 2.86) | < .001 | 1.16 (1.03, 1.30) | .01 | |
Myocardial infarction (yes vs no) | 754 (12.62) | 2.06 (1.87, 2.26) | < .001 | 1.39 (1.25, 1.55) | < .001 | |
Disease (yes vs no) | 64 (1.07) | 4.15 (3.26, 5.28) | < .001 | 1.54 (1.16, 2.05) | .003 | |
Prevalent cancer | < .001 | < .001 | ||||
Time, 0–5 y (yes vs no) | 282 (4.72) | 2.58 (2.05, 3.24) | 2.03 (1.60, 2.58) | |||
Time, > 5 y (yes vs no) | 200 (3.87) | 1.44 (1.18, 1.76) | 1.08 (0.88, 1.30) | |||
Mini-Mental State Examination | 27.26 ±2.84 | 0.59 (0.58, 0.61) | < .001 | 0.86 (0.82, 0.90) | < .001 | |
General health | ||||||
Serious illness in the last 5 y? (yes vs no) | 621 (10.40) | 1.48 (1.33, 1.66) | < .001 | 1.13 (1.00, 1.28) | .05 | |
Unintentional weight loss? (yes vs no) | 675 (11.30) | 2.12 (1.92, 2.33) | < .001 | 1.22 (1.09, 1.36) | < .001 | |
How is your general health compared with members of your age group? | < .001 | < .001 | ||||
Better | 3083 (51.61) | 1.00 (Ref) | 1.00 (Ref) | |||
Same | 2299 (38.48) | 0.97 (0.90, 1.05) | 1.06 (0.97, 1.15) | |||
Worse | 592 (9.91) | 1.59 (1.42, 1.77) | 1.32 (1.14, 1.53) | |||
Prevalent memory complaints | ||||||
Time, 0–5 y | < .001 | .003 | ||||
No memory complaints | 5559 (93.06) | 1.00 (Ref) | 1.00 (Ref) | |||
Mild memory complaints | 370 (6.19) | 4.86 (4.05, 5.82) | 1.15 (0.94, 1.40) | |||
Severe memory complaints | 45 (0.75) | 10.36 (7.31, 14.67) | 1.02 (0.64, 1.63) | |||
Time, > 5 y | < .001 | .003 | ||||
No memory complaints | 4928 (95.41) | 1.00 (Ref) | 1.00 (Ref) | |||
Mild memory complaints | 207 (4.01) | 2.61 (2.19, 3.1) | 1.07 (0.90, 1.27) | |||
Severe memory complaints | 15 (0.29) | 17.04 (10.25, 28.33) | 3.34 (1.82, 6.11) |
Note. Chr = chromosome; CI = confidence interval; CVA = cardiovascular accident; GWA = genome-wide association; RR = relative risk. Total number of participants was 5974. The table shows all variables significant in the final model. Variables included in the full model but not included in this table are the following: socioeconomics: education, social class, occupation, insurance, living circumstance, death of spouse; lifestyle: alcohol consumption in g/day, fruit consumption in g/day, vegetable consumption in g/day; physiology: waist circumference, high density lipoprotein cholesterol, bone mineral density of lumbar spine, bone mineral density of femoral neck; general health: specialist visit within the last month, number of specialist visits in the last year, general practitioner visit within the last month, number of general practitioner visits in the last year, hospitalization within the last year, falls in the previous month, activities of daily living; disease: gout, vertebral fracture, cardiovascular accident, transient ischemic attack, hip fracture, coronary operation.
Values represent the frequency of allele 1.
Of the candidate genes, a priori identified in the literature, APOE, insulin-like growth factor 1 receptor (IGF1R), and Werner syndrome, RecQ helicase-like (WRN) were significant and independent predictors of mortality. From the 50 independent loci, identified from the meta-analysis of 8 discovery genome-wide association studies on time to death, 2 SNPs in the neighborhood of tripartite motif-containing 32 (TRIM32) and methionine adenosyltransferase II, β (MAT2B) were associated with mortality.
Social class and living in serviced housing were independently associated with risk of death. Smoking status and pack-years as well as energy intake were also associated with mortality. The physiological measures blood pressure, body mass index, and waist circumference, and, in particular, the risk indicators assessed in blood (such as erythrocyte sedimentation rate, leukocytes, creatinine, C-reactive protein, and total cholesterol) or with imaging (such as bone mineral density of the femoral neck and aortic calcification) were all independently related to mortality. Diabetes, cardiac diseases, Parkinson’s disease, cancer, and cognitive function remained independently associated with death. Self-perceived comparative health was a good indicator of mortality risk, as were unintentional weight loss and serious illness in the past 5 years.
The predictive power of the variable groups is best explained in 2 ways. First, Figures 1 and 2 show the development of predictive quality over time. For each point during follow-up, the graphs depict the respective time-dependent area under the curve of a given variable group. Next, Table 2 quantifies the predictive quality for 5 specific prediction intervals (1, 3, 5, 10, and 15 years).
FIGURE 1—
Time-dependent receiver-operating characteristic curves for prediction of mortality for different groups of variables: The Rotterdam Study, 1990–2009.
Note. AUC = area under the curve.
FIGURE 2—
Time-dependent receiver-operating characteristic curves for prediction of mortality for age and gender and differently combined models: The Rotterdam Study, 1990–2009.
Note. AUC = area under the curve.
TABLE 2—
C-Index for Different Combinations of Risk Factors at Different Time Points During 15-Year Follow-Up: The Rotterdam Study, 1990–2009
0–1 Years, C-Index (95% CI) | 0–3 Years, C-Index (95% CI) | 0–5 Years, C-Index (95% CI) | 0–10 Years, C-Index (95% CI) | 0–15 Years, C-Index (95% CI) | |
Age and gender | 0.80 (0.78, 0.82) | 0.80 (0.78, 0.81) | 0.79 (0.77, 0.8) | 0.77 (0.76, 0.78) | 0.76 (0.75, 0.77) |
Genetics | 0.55 (0.53, 0.58) | 0.55 (0.53, 0.58) | 0.55 (0.54, 0.57) | 0.56 (0.54, 0.57) | 0.56 (0.55, 0.57) |
Socioeconomics | 0.79 (0.77, 0.81) | 0.78 (0.76, 0.8) | 0.76 (0.75, 0.78) | 0.73 (0.72, 0.74) | 0.72 (0.71, 0.73) |
Lifestyle | 0.64 (0.62, 0.66) | 0.63 (0.62, 0.65) | 0.62 (0.61, 0.64) | 0.60 (0.59, 0.61) | 0.59 (0.58, 0.6) |
General health | 0.79 (0.76, 0.82) | 0.77 (0.75, 0.8) | 0.75 (0.73, 0.77) | 0.71 (0.7, 0.72) | 0.68 (0.67, 0.69) |
Disease | 0.78 (0.74, 0.82) | 0.76 (0.73, 0.79) | 0.75 (0.72, 0.77) | 0.71 (0.7, 0.72) | 0.69 (0.68, 0.7) |
Physiology | 0.79 (0.75, 0.82) | 0.77 (0.74, 0.8) | 0.76 (0.73, 0.78) | 0.73 (0.72, 0.75) | 0.72 (0.71, 0.73) |
Age and gender + socioeconomics and lifestyle | 0.82 (0.79, 0.84) | 0.81 (0.79, 0.83) | 0.80 (0.79, 0.82) | 0.78 (0.78, 0.79) | 0.77 (0.77, 0.78) |
Age and gender + general health + disease + physiology | 0.85 (0.82, 0.87) | 0.84 (0.82, 0.86) | 0.83 (0.81, 0.84) | 0.81 (0.8, 0.82) | 0.79 (0.78, 0.8) |
Full model: age and gender + socioeconomics + lifestyle + general health + disease + physiology + genetics | 0.85 (0.83, 0.88) | 0.84 (0.82, 0.86) | 0.83 (0.82, 0.85) | 0.81 (0.81, 0.82) | 0.80 (0.79, 0.81) |
Note. CI = confidence interval. The C-indices in this table are the areas under the curve of the graphs in Figures 1 and 2 for 5 different time intervals (1, 3, 5, 10, and 15 y). A C-index of 0.50 indicates a prediction of mortality that is no better than chance, whereas a C-index of 1.0 reflects perfect predictive quality.
Figure 1 shows that over time, all variable groups except genetic risk markers exhibited decreasing ability to predict death. Figure 1 and Table 2 also demonstrate that prediction based solely on age and gender consistently outperformed all other groups of variables (C-statistic for 15-year follow-up [C15y] = 0.76; 95% CI = 0.75, 0.77; Table 2). Physiological risk and socioeconomic characteristics each predicted mortality equally well over 15 years (each C15y = 0.72; 95% CI = 0.71, 0.73). Although of significantly less predictive quality, the C-index of genetic risk markers was still better than chance (C15y = 0.56; 95% CI = 0.55, 0.57).
Figure 2 shows the performance of the age- and gender-adjusted model compared with the fully adjusted model (C15y = 0.80; 95% CI = 0.79, 0.81). Whereas adding socioeconomic and lifestyle information to age and gender (C15y = 0.77; 95% CI = 0.77, 0.78) only slightly improved the predictive quality, the combination of age, gender, general health, disease, and physiology (C15y = 0.79; 95% CI = 0.78, 0.80) predicted mortality almost as well as all covariates that remained after backward regression.
To allow comparison with other studies of cause-specific mortality and different population health status, we report the associations of the final model stratified by prevalent, baseline disease status and for cardiovascular disease mortality in Table D (available as a supplement to the online version of this article at http://www.ajph.org).
DISCUSSION
From a set of 162 established risk factors and candidate SNPs for mortality, we identified 36 (31 nongenetic, 5 genetic) independent and significant predictors of mortality. Specific genetic factors were independently associated with mortality and jointly predicted mortality better than chance. However, genetic information added little to age, gender, and other traditional predictors of mortality.
This analysis confirms prior findings that multiple diseases, as well as socioeconomics and lifestyle, jointly influence mortality in the aging adult population.3,6 Numerous predictors remained independently and significantly associated with mortality. Although several markers of prevalent disease remained associated with mortality, their risk ratios were attenuated. Others, such as prevalent dementia, cerebrovascular accidents, and transient ischemic attacks, were not more effective at predicting mortality than indicators of disease severity and subclinical disease such as the Mini-Mental State Examination and serum C-reactive protein.
Several specific SNPs were independently associated with mortality in our analysis. In accordance with the literature, each additional copy of the APOE ε4 allele increased mortality in our study cohort.1,36,37 Similarly, the IGF1R and WRN genes have been described before as being associated with longevity and aging, respectively, via improving stress resistance, innate immunity, metabolic maintenance and repair of DNA.1 This study also confirmed 2 novel loci in the vicinity of the TRIM32 and MAT2B genes as being associated with death. These 2 loci were identified from the pool of SNPs identified by the meta-analysis of the discovery cohorts in the genome-wide association studies. These genes have recently been associated with cancer proliferation.38,39
Interestingly, unlike the predictive ability of all other domains, the ability of genetic markers to predict death remained constant during the course of follow-up. The stability of the predictive power of the SNPs observed in this study is probably due to the permanent nature of the genetic makeup. Other variables showed decreasing predictive power over time that can be explained by changes in the value of these variables during the course of follow-up.
Our results support the view that specific SNPs can be identified that are associated with mortality and might be used for risk prediction.40,41 The results also show, however, that these common SNPs have very limited predictive power and that, especially when used in combination with traditional risk factors, they contribute very little, if anything at all, to improve the prediction of death in the general population aged 55 years and older.
Another finding relates to age and gender as predictors of mortality. In our study, the relative risk for mortality per year of age in the univariate model was only reduced by 25% in the fully adjusted model. This is compatible with the idea that aging is not merely the clinical manifestation of disease but an underlying, disease-independent accumulation of pathophysiological changes that favor mortality over time.42–44
The gender differences in this study were not due to differences in prevalent diseases at the onset of the study. Females exhibited even stronger reduced risks after adjustment for other risk factors. This strongly suggests that the gender difference in survival cannot be explained by differences in health behavior and disease at baseline.45 We can only speculate that the survival benefits of females can be found in gender effects or different genetic origins not accounted for in this analysis. One of the potential genetic candidates that could contribute to the female survival advantage is the X chromosome.46 Another genetic candidate is mitochondrial DNA (mtDNA). It has been suggested that, because mtDNA is inherited from the maternal line, a possible intergenomic conflict between mtDNA and nuclear DNA favors female survival.47 We could not include markers on the X chromosome and mtDNA because the X chromosome is not commonly analyzed in all the discovery cohorts and mtDNA is also not available on all genotyping platforms.
Prediction of mortality by age and gender improved by only 5% upon inclusion of all independent mortality predictors over the entire 15 years of follow-up. This is particularly interesting considering the multitude of independent risk factors identified in this study, and because all groups contributed significant variables. From the time-dependent area-under-the-curve analyses, 2 observations are particularly noteworthy: first, most of the additional information beyond age and gender stemmed from indicators of physiology, disease, and general health; second, although socioeconomic factors were equally good at predicting mortality as indicators of disease, the combination of socioeconomics, lifestyle, and genetic markers contributed visibly to the explanation of mortality risk only after 10 years of follow-up and beyond. Thus, although socioeconomics and lifestyle were associated to mortality, they seemingly exerted their effects on mortality through physiological risk indicators and disease rather than by acting independently on mortality.48 This underscores the importance of socioeconomics and living conditions for public policy aimed at reducing health inequalities.
To summarize the findings reported here and in other studies, one can insinuate a cascade from gene to individual health to death, in which every step is accompanied by environmental influences, some of which are controlled by the individual (such as physical activity and obesity) whereas others are defined by the individual’s living circumstances, cultural heritage and surroundings. Figure A (available as a supplement to the online version of this article at http://www.ajph.org) illustrates at which stage during the course of aging different interventions (e.g., improvements in living circumstances or the introduction of a new therapeutic drug) can feasibly act and which health gains could be expected.
Limitations
Caution is needed in interpreting this study. It was not our aim to evaluate the size of the mortality risk associated with single risk factors. Several of the markers in this analysis describe the same underlying construct (e.g., body composition). Therefore, the specific relative risks must be interpreted cautiously. Other important aspects of health such as physical activity and mental health are barely represented among the risk factors analyzed, as only “riding a bike” and “self-perceived comparative health” were available to approximate these important dimensions of health. Another limitation concerns the genetic markers used in this study. We included only autosomal SNPs. Genetic risk is transferred through several other mechanisms, including DNA methylation, copy number variations, and mitochondrial DNA. Furthermore, because this study has not been replicated externally, it probably cannot be used for constructing a risk score. We did not seek external replication because of the multitude of specific risk factors and instead relied on bootstrapping and cross-validation for guiding the LASSO analysis and the estimation of the C-index. At the same time, the selection of SNPs is among the strengths of this study, as the SNPs were identified from 2 sources, independent of the population under study. Other strengths are related to the multitude of risk factors and the prospective design with long follow-up.
Conclusions
We found 36 variables that independently and significantly predicted mortality in the Rotterdam Study population. Adding further risk indicators to age and gender improved our ability to predict death, but the gain in predictive quality was modest, particularly in the long run. Surprisingly, specific genetic risk factors, independently and as a group, predicted mortality, but their added value to conventional predictors of mortality was low. Our findings also support the importance of primary prevention in the areas of socioeconomics and lifestyle, as we could illustrate how these risk factors continuously influence mortality risk independently and through their impact on physiological risk status and disease.
Acknowledgments
The Rotterdam Study is supported by the Erasmus Medical Center and Erasmus University Rotterdam, The Netherlands Organization for Scientific Research, The Netherlands Organization for Health Research and Development, the Research Institute for Diseases in the Elderly, the Ministry of Education, Culture and Science, the Ministry of Health, Welfare and Sports, and the European Commission and the Municipality of Rotterdam. This research was supported in part by the Intramural Research Program, NIH, National Institute on Aging. Stefan Walter is financed by Netspar and the Netspar theme “Living Longer in Good Health.” The work of Henning Tiemeier is supported by Vidi (grant 017.106.370).
Human Participant Protection
The Rotterdam Study has been approved by the Medical Ethics Committee of the Erasmus Medical Center. All participants provided written informed consent.
References
- 1.Christensen K, Johnson TE, Vaupel JW. The quest for genetic determinants of human longevity: challenges and insights. Nat Rev Genet. 2006;7(6):436–448 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Oeppen J, Vaupel Demography JW. Broken limits to life expectancy. Science. 2002;296(5570):1029–1031 [DOI] [PubMed] [Google Scholar]
- 3.Fried LP, Kronmal RA, Newman ABet al. Risk factors for 5-year mortality in older adults: The Cardiovascular Health Study. JAMA. 1998;279(8):585–592 [DOI] [PubMed] [Google Scholar]
- 4.Noale M, Minicuci N, Bardage Cet al. Predictors of mortality: an international comparison of socio-demographic and health characteristics from six longitudinal studies on aging: the CLESA project. Exp Gerontol. 2005;40(1–2):89–99 [DOI] [PubMed] [Google Scholar]
- 5.Lee SJ, Lindquist K, Segal MR, Covinsky KE. Development and validation of a prognostic index for 4-year mortality in older adults. JAMA. 2006;295(7):801–808 [DOI] [PubMed] [Google Scholar]
- 6.Newman AB, Sachs MC, Arnold AMet al. Total and cause-specific mortality in the cardiovascular health study. J Gerontol A Biol Sci Med Sci. 2009;64(12):1251–1261 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Baer HJ, Glynn RJ, Hu FBet al. Risk factors for mortality in the nurses’ health study: a competing risks analysis. Am J Epidemiol. 2011;173(3):319–329 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hofman A, Grobbee DE, de Jong PT, van den Ouweland FA. Determinants of disease and disability in the elderly: the Rotterdam Elderly Study. Eur J Epidemiol. 1991;7(4):403–422 [DOI] [PubMed] [Google Scholar]
- 9.Hofman A, Breteler MM, van Duijn CMet al. The Rotterdam Study: 2010 objectives and design update. Eur J Epidemiol. 2009;24(9):553–572 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Link BG, Phelan J. Social conditions as fundamental causes of disease. J. Health Soc Behav. 1995;(spec no.):80–94 [PubMed] [Google Scholar]
- 11.Psaty BM, O’Donnell CJ, Gudnason Vet al. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium: design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circ Cardiovasc Genet. 2009;2(1):73–80 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Slooter AJC, Cruts M, Kalmijn Set al. Risk estimates of dementia by apolipoprotein E genotypes from a population-based incidence study: The Rotterdam Study. Arch Neurol. 1998;55(7):964–968 [DOI] [PubMed] [Google Scholar]
- 13.Klipstein-Grobusch K, den Breeijen JH, Goldbohm RAet al. Dietary assessment in the elderly: validation of a semiquantitative food frequency questionnaire. Eur J Clin Nutr. 1998;52(8):588–596 [DOI] [PubMed] [Google Scholar]
- 14. Nevo Food Composition Table From the Netherlands Nutrient Databank. Den Haag, The Netherlands: Voorlichtingsbureau voor de Voeding; 1993. [Google Scholar]
- 15.Visscher TL, Seidell JC, Molarius A, van der Kuip D, Hofman A, Witteman JC. A comparison of body mass index, waist-hip ratio and waist circumference as predictors of all-cause mortality among the elderly: The Rotterdam Study. Int J Obes Relat Metab Disord. 2001;25(11):1730–1735 [DOI] [PubMed] [Google Scholar]
- 16.Hak AE, Pols HA, Visser TJ, Drexhage HA, Hofman A, Witteman JC. Subclinical hypothyroidism is an independent risk factor for atherosclerosis and myocardial infarction in elderly women: The Rotterdam Study. Ann Intern Med. 2000;132(4):270–278 [DOI] [PubMed] [Google Scholar]
- 17.Ikram MK, de Jong FJ, Vingerling JRet al. Are retinal arteriolar or venular diameters associated with markers for cardiovascular disorders? The Rotterdam Study. Invest Ophthalmol Vis Sci. 2004;45(7):2129–2134 [DOI] [PubMed] [Google Scholar]
- 18.Hak AE, Pols HAP, Stehouwer CDAet al. Markers of inflammation and cellular adhesion molecules in relation to insulin resistance in nondiabetic elderly: The Rotterdam Study. J Clin Endocrinol Metab. 2001;86(9):4398–4405 [DOI] [PubMed] [Google Scholar]
- 19.Bos MJ, Koudstaal PJ, Hofman A, Witteman JCM, Breteler MMB. Uric acid is a risk factor for myocardial infarction and stroke: The Rotterdam Study. Stroke. 2006;37(6):1503–1507 [DOI] [PubMed] [Google Scholar]
- 20.Burger H, van Daele PL, Algra Det al. The association between age and bone mineral density in men and women aged 55 years and over: The Rotterdam Study. Bone Miner. 1994;25(1):1–13 [DOI] [PubMed] [Google Scholar]
- 21.Ikram MK, Witteman JCM, Vingerling JR, Breteler MMB, Hofman A, de Jong PTVM. Retinal vessel diameters and risk of hypertension: The Rotterdam Study. Hypertension. 2006;47(2):189–194 [DOI] [PubMed] [Google Scholar]
- 22.Ott A, Stolk RP, Hofman A, van Harskamp F, Grobbee DE, Breteler MM. Association of diabetes mellitus and dementia: The Rotterdam Study. Diabetologia. 1996;39(11):1392–1397 [DOI] [PubMed] [Google Scholar]
- 23.Mosterd A, Hoes AW, de Bruyne MCet al. Prevalence of heart failure and left ventricular dysfunction in the general population: The Rotterdam Study. Eur Heart J. 1999;20(6):447–455 [PubMed] [Google Scholar]
- 24.Hofman A, van Duijn CM, Franco OHet al. The Rotterdam Study: 2012 objectives and design update. Eur J Epidemiol. 2011;26:657–686 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bots ML, Looman SJ, Koudstaal PJ, Hofman A, Hoes AW, Grobbee DE. Prevalence of stroke in the general population: The Rotterdam Study. Stroke. 1996;27(9):1499–1501 [DOI] [PubMed] [Google Scholar]
- 26.van der Klift M, De Laet CEDH, McCloskey EV, Hofman A, Pols HAP. The incidence of vertebral fractures in men and women: The Rotterdam Study. J Bone Miner Res. 2002;17(6):1051–1056 [DOI] [PubMed] [Google Scholar]
- 27.van der Klift M, de Laet CE, Coebergh JW, Hofman A, Pols HA. Bone mineral density and the risk of breast cancer: The Rotterdam Study. Bone. 2003;32(3):211–216 [DOI] [PubMed] [Google Scholar]
- 28.Fries JF, Spitz PW, Young DY. The dimensions of health outcomes: the health assessment questionnaire, disability and pain scales. J Rheumatol. 1982;9(5):789–793 [PubMed] [Google Scholar]
- 29.Lawton MP, Moss M, Fulcomer M, Kleban MH. A research and service oriented multilevel assessment instrument. J Gerontol. 1982;37(1):91–99 [DOI] [PubMed] [Google Scholar]
- 30.Wood AM, White IR, Royston P. How should variable selection be performed with multiply imputed data? Stat Med. 2008;27(17):3227–3246 [DOI] [PubMed] [Google Scholar]
- 31.Tibshirani R. The lasso method for variable selection in the Cox model. Stat Med. 1997;16(4):385–395 [DOI] [PubMed] [Google Scholar]
- 32.Goeman J. L1 penalized estimation in the Cox proportional hazards model. Biome J. 2010;52(1)70–84 [DOI] [PubMed] [Google Scholar]
- 33.Schoenfeld D. Partial residuals for the proportional hazards regression model. Biometrika. 1982;69(1):239–241 [Google Scholar]
- 34.Kleinbaum DG, Klein M. Survival Analysis: A Self-Learning Text. Vol 2 New York, NY: Springer; 2005 [Google Scholar]
- 35.Heagerty PJ, Zheng Y. Survival model predictive accuracy and ROC curves. Biometrics. 2005;61(1):92–105 [DOI] [PubMed] [Google Scholar]
- 36.Ewbank DC. Differences in the association between apolipoprotein E genotype and mortality across populations. J Gerontol A Biol Sci Med Sci. 2007;62(8):899–907 [DOI] [PubMed] [Google Scholar]
- 37.Vaupel JW. Biodemography of human ageing. Nature. 2010;464(7288):536–542 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kano S, Miyajima N, Fukuda S, Hatakeyama S. Tripartite motif protein 32 facilitates cell growth and migration via degradation of Abl-interactor 2. Cancer Res. 2008;68(14):5572–5580 [DOI] [PubMed] [Google Scholar]
- 39.Lu SC, Mato JM. S-Adenosylmethionine in cell growth, apoptosis and liver cancer. J Gastroenterol Hepatol. 2008;23(suppl 1):S73–S77 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lunetta KL, D’Agostino RB Sr, Karasik Det al. Genetic correlates of longevity and selected age-related phenotypes: a genome-wide association study in the Framingham Study. BMC Med Genet. 2007;8(suppl 1):S13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kraft P, Hunter DJ. Genetic risk prediction—are we there yet? N Engl J Med. 2009;360(17):1701–1703 [DOI] [PubMed] [Google Scholar]
- 42.Franco OH, Karnik K, Osborne G, Ordovas JM, Catt M, van der Ouderaa F. Changing course in ageing research: The Healthy Ageing Phenotype. Maturitas. 2009;63(1):13–19 [DOI] [PubMed] [Google Scholar]
- 43.Franceschi C, Capri M, Monti Det al. Inflammaging and anti-inflammaging: a systemic perspective on aging and longevity emerged from studies in humans. Mech Ageing Dev. 2007;128(1):92–105 [DOI] [PubMed] [Google Scholar]
- 44.Hayflick L. The future of ageing. Nature. 2000;408(6809):267–269 [DOI] [PubMed] [Google Scholar]
- 45.Liang J, Bennett JM, Sugisawa H, Kobayashi E, Fukaya T. Gender differences in old age mortality: roles of health behavior and baseline health status. J Clin Epidemiol. 2003;56(6):572–582 [DOI] [PubMed] [Google Scholar]
- 46.Christensen K, Orstavik KH, Vaupel JW. The X chromosome and the female survival advantage: an example of the intersection between genetics, epidemiology and demography. Ann N Y Acad Sci. 2001;954:175–183 [PubMed] [Google Scholar]
- 47.Rand DM. Mitochondrial genetics of aging: intergenomic conflict resolution. Sci Aging Knowledge Environ. 2005;2005(45):re5 [DOI] [PubMed] [Google Scholar]
- 48.Steenland K, Henley J, Calle E, Thun M. Individual- and area-level socioeconomic status variables as predictors of mortality in a cohort of 179,383 persons. Am J Epidemiol. 2004;159(11):1047–1056 [DOI] [PubMed] [Google Scholar]