Introduction

Hypertension, as the leading risk factor for cardiovascular morbidity and mortality, already affects one billion people worldwide.1 Complications of hypertension account for 9.4 million global deaths every year.2 To reduce the burden of cardiovascular disease, guidelines have stressed on the importance of preventing hypertension.3 While China, one of the world’s most populous and fastest developing countries, data reported that increased prevalence of hypertension partly attributed to increased incidence of hypertension.4, 5 A latest survey demonstrated that the prevalence rate of hypertension in China has increased to 29.6% and the awareness is just 42.6%, which is much lower than developed countries (66.5%).6 Hundreds of millions Chinese have not realized the silent, invisible killer.

Consequently, it is very necessary to establish prediction model to identify people at high risk for inclusion in clinical trials for hypertension prevention. So far many countries and regions have developed or validated risk models that predict the onset of hypertension. Among those, Framingham hypertension risk score is widely used to predict the risk for near-term incident hypertension.7 While Framingham Heart Study was based on Caucasian, its prediction models have been validated not so suitable for Chinese.8, 9, 10 With respect to Chinese, a 15-year risk of incident hypertension model was established on a cohort which set up since 1992 from 11 provinces and cities of China.11 Yet, the 15-year risk model of hypertension has not considered so many biochemical risk factors in the beginning of constructing model. Another famous Chinese hypertension risk model was established by the Chin-Shan Community Cardiovascular Cohort Study in Taiwan.12 However, lifestyle risk factors were lacking in the model of this community study.

Therefore, we aimed to investigate the incidence of hypertension in the northern urban Han Chinese health check-up population and construct a hypertension prediction model through the prospective cohort. We included many biochemical and lifestyle predictors that were confirmed by other investigations and available in the routine health check-up.

Subjects and Methods

Study population and cohort design

Subjects who participated in routine health check-up system in Center for Health Management of Shandong Provincial Qianfoshan Hospital and Shandong Provincial Hospital between 2005 and 2010 composed the longitudinal cohort representative of the northern urban Han Chinese population (n=95 908). For our research, participants were eligible for inclusion if they possessed at least three year’s health check-up records between 2005 and 2010 (n=18 665). We excluded participants at the first check-up who had hypertension with antihypertensive medication, or systolic blood pressure (SBP) ⩾140 mm Hg, or diastolic blood pressure (DBP) ⩾90 mm Hg, or cardiovascular disease (coronary heart disease, cardiomyopathy, atrial fibrillation), or stroke (ischemic stroke and hemorrhagic stroke); were younger than 18 years and had missing covariates. Finally, 7 537 men and 4 960 women remained available for analysis. The median follow-up was 4.0 years.

Measurements

At every health check-up, participants underwent a routine questionnaire, anthropometric, clinical and laboratory tests after at least a 12-h fast. The questionnaire contained lifestyle behaviors of smoking, drinking and physical activity and history of cardiovascular disease, hypertension, contents of medicine use and other background information. The anthropometric measurement covered weight, height and blood pressure. Weight and height were measured on subjects wearing light clothes without shoes using a calibrated scale. Body mass index (BMI) was calculated as weight/height2 (kg m2), as an estimate of obesity. Blood pressure measurement was made on the right arm with Omron HEM-907 (Omron Healthcare Co., Ltd., Matsusaka-City, Mie, Japan) by a cuff-oscillometric method. Participants had rested for 5 min in the sitting position, and the average of two measurements was recorded as the examination blood pressure. The laboratory test included fasting blood glucose (FBG), total cholesterol (CHOL), triglycerides (TG), high-density lipoprotein (HDL), gamma-glutamyl transferase (GGT), serum albumin (ALB), serum globulins (GLO), hemoglobin (Hb), hematocrit (HCT), white blood cell count (WBC), lymphocyte (LC), neutrophil granulocyte (NGC) and monocyte (MONC).

Definition of hypertension

The outcome of interest was defined as first occurrence at any follow-up medical check-up of SBP ⩾140 mm Hg or DBP ⩾90 mm Hg or of the person taking antihypertensive medication.

Statistical analysis

For baseline characteristics, continuous variables were presented as mean (s.d.), and categorical variables were presented as frequency (percentage). Difference in baseline characteristics between individuals who developed hypertension and those who did not were compared by Student's t-test or t’-test for continuous variables and by χ2 test for categorical variables. We calculated the number of person-year as the sum of the follow-up times from the baseline to the onset of hypertension or the last health check-up. Cox proportional hazard models were used to estimate the unadjusted hazard ratios. Then, a set of risk factors were selected via backward cox proportional model, and we also detected the interaction term for DBP, SBP and BMI by age. To apply the multivariable model to predict the onset of hypertension, we evaluated the area under receiver operator characteristics curve (AUC) and the AUC after 10-fold cross-validation. Finally, based on the predictive probability, we divided study sample into four risk levels. We did sex-specific analysis for all the investigations. All statistical analyses were performed using SAS 9.2 (SAS Institute, Inc., Cary, NC, USA), and a two-sided P<0.05 was set as statistically significant.

Results

The baseline characteristics of study sample were shown in Table 1.The average age at baseline was 41.73 in men and 39.49 in women. The mean or frequency of most indexes in men was higher than those women except for GLO and HDL. Most women did not smoke or drink.

Table 1 Baseline characteristics of study sample

During the 38 958 person-years of follow-up, 2785 participants developed hypertension between 2005 and 2010. Figure 1 demonstrated the constructed prospective cohort and the occurrence of hypertension in detail. The incidence density of hypertension in this prospective cohort was 71.48 per 1000 person-year. Diagnosis of new-onset hypertension was based on increased SBP alone in 1199 (43.05%) participants, DBP alone in 534 (19.17%), and both SBP and DBP in 498 (17.88%). Among the participants with incident hypertension, the treatment rate was 19.89% (n=554) and the blood pressure control rate was 10.99% (n=306).

Figure 1
figure 1

Number of participants eligible for the prospective cohort at baseline and number of participants with onset hypertension during follow-up.

Table 2 presented that the incidence of hypertension during follow-up in men (2021 cases, 26.81%) was higher than in women (764 cases, 15.40%). The statistically-significant differences between non-hypertension and hypertension for the baseline characteristics were observed apart from HDL, Hb, current smoking in men, and HDL in women. Most listed baseline variables in hypertension were higher than non-hypertension other than ALB and HDL in both men and women.

Table 2 Baseline characteristics of men and women according to the occurrence of hypertension during follow-up

Gender differences were found in the incidence of hypertension, and the HR (95% confidence interval (CI)) of 1.857 (1.708–2.018) for men vs women. Further subgroup analyses in men and women subpopulation were structured. Table 3 illustrated the results of the association of related predictors with hypertension in unadjusted cox model. No statistically-significant differences were found in HDL, Hb, HCT and current smoking for men and HDL, LC and MONC for women.

Table 3 Associated risk factors for incident hypertension in men and women in unadjusted cox model

Furthermore, multivariable cox models were constructed with all the predictors using backward elimination. We finally got seven variables in men and nine in women, shown in Table 4. These factors were independent predictors of hypertension. The interactions of BMI by age and DBP by age were statistically significant for men. Increasing levels of BMI and DBP were associated with an increased risk for incident hypertension across all ages, but the slope of the association was steeper in younger than in older. We also considered the interactions of SBP by age in men and all above interactions in women which did not enter the models.

Table 4 Estimated coefficient, s.e., HR, 95% CI and significant levels in the multivariable cox model via backward elimination for the prediction of incident hypertension

In addition, ROC curve for the multivariable models was presented in Figure 2. The AUC (95% CI) was 0.761 (0.752–0.771) for men and 0.753 (0.741–0.765) for women. While after 10-fold cross-validation, the AUC (95% CI) was 0.760 (0.751–0.770) for men and 0.749 (0.737–0.761) for women.

Figure 2
figure 2

ROC curve for incident hypertension prediction models in men and women. (a, b) Presents the predictive power for men and women, respectively. (a) AUC (95% CI): 0.761 (0.752–0.771), z-statistic: 38.892; significance level P (area=0.5): 0.0001; cutoff point 0.3470 with sensitivity 0.724, specificity 0.672 and accuracy 68.6%; (b) AUC (95% CI): 0.753 (0.741–0.765), z-statistic: 23.741; significance level P (area=0.5): 0.0001; cutoff point 0.2262 with sensitivity 0.644, specificity 0.754 and accuracy 73.6%.

Finally, we divided study sample into four risk levels sex-specifically. As the distribution of predictive probability was not normal, we used quartiles as the dividing values. The first 25% participants were at low risk, and the following three parts were respectively at moderate, high and very high risk. The absolute risk and actual probability of incident hypertension of each risk level were shown in Table 5. The absolute risk of incident hypertension in 5 years at moderate, high and very high risk level was 2.13, 3.84 and 6.14 times of those who were at low risk in men, and 1.30, 2.56 and 6.01 times of those who were at low risk in women.

Table 5 Risk stratification and its predicted probability, absolute risk and actual probability

Discussion

With this prospective cohort study, we investigated the incidence of hypertension in northern urban Han Chinese population, identified the multi-marker of new-onset hypertension in this population and constructed multivariable prediction hypertension models sex-specifically resulting from 20 risk factors concerning different pathophysiological mechanisms. To our knowledge, seldom investigations examined so many risk factors to predict the onset of hypertension in northern urban Han Chinese population. The validity of our study was stressed by the prospective cohort analysis; sample size was very large; the blood pressure was obtained in a standardized manner in routine health check-up system by physician; the candidate predictors, which were available in routine health check-up were both biologically plausible and confirmed by previous works.

On the basis of previous national survey data in China, the prevalence of hypertension in adults has increased from 5.1% in 1959 to 11.3% in 1991 and 29.6% in 2009.6, 13, 14 Furthermore, an analysis of hypertension incidence trends demonstrated that high prevalence was partly owing to the increased incidence of hypertension. Results from a Chinese cohort study stated incidence of hypertension (per 1000 person-year) significantly increased from 29 in 1991–1997 to 53 in 2004–2009.4 In our study, the incidence of hypertension was 71.48 per 1 000 person-year, higher than that of previous study. This may be explained by the following reasons: first, our sample was got from the northern urban Chinese population. The northern Chinese population preferred to eat salty food and the prevalence and incidence were always higher than southern Chinese population and overall Chinese population.15 Second, our sample was based on health-check databases, which included a set of individuals who actively participated in health examination by personal expense. Therefore, our sample may be different from population who recruited from nine provinces through multistage, randomized, cluster sampling process in various aspects including health consciousness. Third, from the time point of view, the incidence may be higher in accordance with the trend of hypertension onset.4

Whether the incidence of hypertension is associated with gender is controversial. In our study, the future risk of incident hypertension was higher in men than in women, which was similar to a set of studies.16, 17 However, some investigations have suggested that women had a higher risk7 or sex was not a vital determinant.18 As expected, hypertension risk increased with aging and elevated blood pressure. The increasing future risk of hypertension with aging is due primarily to increased peripheral vascular resistance and vascular stiffness.19, 20 Previous investigations have demonstrated that baseline SBP and DBP are predominant determinants of incident hypertension, even each unit increase in the blood pressure above the optimal value of 119/79 mm Hg elevates the risk of onset hypertension.7, 12, 21 Of our study outcome, sole elevated SBP and sole elevated DBP accounted for 43.05% and 19.17%. Additionally, the interaction of DBP by age in men was statistically significant, which meant that the effect of DBP was more obvious at younger ages. These phenomena can be explained that SBP invariably rises with aging, while DBP tends to decline after fifth or sixth decade.20 Furthermore, the mechanism behind it may be resulted from large artery stiffness dominating vascular resistance and DBP falling.

BMI as general obesity index was higher in participant with hypertension than without hypertension at baseline in our study. Many investigations demonstrate that obesity is closely related to incident hypertension, and obesity is one of the most important independent risk factors of hypertension,7, 16, 22 which is consistent with our results. In addition, we discovered the interaction between BMI and age in men significantly. Just as DBP, increasing levels of BMI were associated with an increased risk for incident hypertension across all ages in men, but the slope of the association was steeper in young than in old. Unfortunately, measurements of abdominal obesity such as waist circumference and waist–hip ratio were absent in the present study, which were superior to BMI in predicting cardiovascular risk.23 Yet, a study has demonstrate that BMI was as strong as abdominal indicators in the relation of development of hypertension.22

Our prospective finding stated that current drinking was associated with the development of hypertension in both genders, and current smoking was only hazardous in women. This was determined by the baseline characteristics of our sample, which indicated no significant distribution difference of current smoking in men with or without future hypertension, so we may not get whether current smoking was harmful for men according to our results. While in many investigations, the risk of incident hypertension of current smokers has been validated much higher than non-smokers.7, 21, 24

Our study indicated that current drinking increased the risk of future hypertension, which was also suggested by other investigations.25, 26 Of note, another lifestyle factor, regular exercise was presented as a risk factor in unadjusted model, this may be caused by potential measurement misclassification in assessing exercise levels.

We can get many biomedical factors in routine health check-up whose effects are much originated from age, gender, smoking, drinking and other lifestyle factors. After backward elimination, GGT, FBG in men and FBG, CHOL, NGC in women were remained as secondary prevention biomedical factors. Quite a few investigators considered diabetes or FBG as a predictor of incident hypertension.12, 24 In our multivariable models, each unit increased in FBG, the risk of incident hypertension increased 11.6% for men and 12.3% for women. Potential relationships between glucose metabolism and hypertension seem to be multidirectional mediators involving impairing endothelium-dependent vasodilation, oxidative stress and insulin resistance.27, 28, 29 GGT has been reported to be a stronger predictor of incident hypertension, even among non-drinkers.30 Similarly, we found that GGT was significant for men in the multivariate model. Considering known functionality of GGT, this would be accounted for producing oxidative stress.30, 31 Among lipoproteins, HDL is a protective factor of incident hypertension, responsible for reversing cholesterol transport. Inversely, elevated CHOL and TG refer to atherosclerosis, endothelial dysfunction, damage of the renal microvasculature, which all infect the development of hypertension.32 With respect to our results, one unit increase in CHOL leads to 14.3% risk increment of incident hypertension for women in multivariable cox model. Our prospective finding demonstrated that leukocyte parameters were associated with incident hypertension in both men and women in unadjusted models. Only NGC was left in multivariable model for women and this was consistent with a Japanese study.33 The linkage between NGC and hypertension may be accounted for increased tendency to adhere to vascular endothelium and the release of ROS, which refers to vascular resistance and oxidative stress.34

Our final multivariable models based on northern urban Han Chinese population consisted of demographic, lifestyle and biomedical factors. Meanwhile, the performance of the final models was acceptable, whose AUC was 0.761 (95% CI, 0.752–0.771) for men and 0.753 (95% CI, 0.741–0.765) for women. Furthermore, AUC after 10-fold cross-validation was approaching to values of the original models. Despite Framingham hypertension risk score was most widely used to predict the risk for near-term incident hypertension, the source population was Caucasian. Investigators have presented that prediction models developed using certain ethnic groups tend to overestimate or underestimate the incidence in other ethnic groups.12, 24 Even in ethnic Chinese, the 15-year risk model of incident hypertension constructed by samples from 11 provinces and cities of China did not include so many biochemical predictors in the beginning.11 Conversely, the prediction model based on Chin-Shan Community Cardiovascular Cohort Study in Taiwan was lacking of lifestyle risk factors.12 As a whole, our final multivariate model was much rational. Other detailed information about our prediction model was available in the Supplementary Appendix.

According to the current health-care system in China,35 much have been invested in the treatment for non-communicable chronic diseases, especially cardiovascular disease. It was urgent to take actions on prevention of cardiovascular disease. Prevention of hypertension onset may represent primordial prevention of cardiovascular disease. Hypertension prediction models can be used to select people at high risk for personalized intervention, to estimate future burden of hypertension and to assist in projecting hypertension prevention strategies and allocation of resources in the light of hypertension risk at a population level.

Nevertheless, several limitations of this study must be considered. First, we selected the predictors that available in routine health check-up, yet did not include many predictors such as family history, serum creatinine, uric acid, C-reactive protein, education, sodium intake, psychosocial factors and genetic markers. As a large set of predictors were associated with others, variables failed to be retained in the final model resulting from non-significant relation with the onset of hypertension. Second, most of the participants who had regular check-ups were younger than 50 years, and they may be a healthier subset of the general population.

In conclusion, we identified the multiple predictors of hypertension and established hypertension prediction models sex-specifically with a prospective cohort study. Our results were consistent with the comprehensive mechanisms of onset of hypertension. The prediction model may give a rough risk of hypertension and facilitate prevention of hypertension and other cardiovascular diseases.