Introduction

Atrial fibrillation (AF), one of the most common cardiac arrhythmia in clinical practice, has emerged as a major public health problem and is closely related to stroke, mortality, decreased quality of life and a high healthcare cost burden.1 In the last 20 years, the prevalence of AF was increasing worldwide1, 2 and affected ~5–8% of those older than 65 years in western countries.3, 4, 5, 6 It was estimated that the number of patients with AF in 2030 in Europe would be 14 to 17 million and the number of incident AF for 1 year would be 120 000 to 215 000.1 In China, the updated information on the prevalence of AF is 3.5% for those above 65 years old and the age-standardized prevalence of AF in China (⩾30 years) is 0.65%.7 Although the prevalence of AF is lower in China than in western countries, the prevalence of AF has increased 20-fold over an 11-year period in China,8 and AF burden as well as the risk of AF-related stroke have increased significantly over the past decade.8 The increasing trend of AF challenges the prevention and control of cardiovascular disease, and identifying high-risk individuals for targeted prevention of AF is useful in the primary prevention.

Various prediction models have been constructed to assist clinicians and epidemiologists in assessing individual’s risk of AF. The most widely used model is the 10-year risk score in the Framingham Heart Study9 and the more recent one for 5-year risk of AF was the Cohorts for Heart and Aging Research in Genomic Epidemiology-AF scores.10 All these prediction models were derived from theUS population and validations in minorities in the United States have shown poor calibration and moderate discriminatory accuracy.11 To our knowledge, no studies have evaluated the applicability of these prediction models in Asia population and there is no AF prediction model derived from Asia population. As both the incidence and risk burden of AF vary by regions, races and social economic status,2 it is necessary to investigate an applicable AF prediction model in Chinese population, to help estimate AF risks and identify high-risk individuals for primary prevention of AF in clinical and public health practice.

Therefore, in this study, we aimed to investigate the incidence of AF in the Chinese health check-up population, as well as to construct an AF prediction model using behavioral, anthropometric, biochemical and electrocardiogram (ECG) markers available in the routine health check-up.

Subjects and Methods

Study population, cohort design and diagnosis of AF

The study was based on Shandong multi-center health check-up longitudinal study. The database of Shandong multi-center health check-up longitudinal study include individuals participated in routine health check-up at several hospitals from 2004 to 2014. The characteristics of participants in this cohort included Han Nationality, occupational or retired (having jobs or retirement) and living in urban area. A total of 95 874 participants who visited the health check-up for at least two times with ECG examination in the health check-up from 2004 to 2014 were included in this study. Among these participants, 252 persons were excluded for having AF or atrial flutter at baseline; therefore, 95 622 participants were included as original cohort to get age-specific incidence. As the incidence of AF was very low before 45 years (see Figure 1), 33 186 participants aged 45–85 years were included in the final analysis (see Figure 2 for the framework of this cohort). Incident AF cases were ascertained from a rest 12-lead ECG in an annually regular clinical and laboratory examination, with atrial flutter and AF diagnosed by ECG as AF cases.

Figure 1
figure 1

The incidence of atrial fibrillation by age in the original cohort (n=95 622).

Figure 2
figure 2

The framework of cohort including 33 186 participants aged 45–85 years for final analysis.

The study protocol was approved by ethics committee of School of Public Health, Shandong University. Written informed consent was obtained from all participants.

Measurements

All participants in the health check-up underwent a standardized routine health questionnaire, anthropometric measurements, laboratory tests and a rest 12-lead ECG. Behavioral factors such as smoking, drinking and disease history were obtained from self-reported questionnaire. Anthropometric measurements included weight, height and blood pressures. Body mass index was calculated by dividing weight in kilograms by height in meters squared. Systolic blood pressure (SBP), diastolic blood pressure (DBP) and heart rate were measured twice on the right arm after 5 min of rest in sitting position. We defined visit-to-visit variability (VVV) in blood pressure as the s.d. of blood pressure measurements during follow-up. Blood samples were taken after at least 12 h overnight fasting for laboratory tests. Red blood cell count, white blood cell count, hemoglobin, hematocrit, platelet count, fasting blood glucose, total cholesterol, triglycerides, high-density lipoprotein, low-density lipoprotein, serum creatinine, alanine transaminase, total bilirubin, uric acid, serum albumin, serum globulin, blood urea nitrogen were included in this analysis.

Smoking was coded as ever smoker and never smoker, and drinking was coded as current drinker and current non-drinker (never drinker and ex-drinker). History of coronary heart disease (CHD) was defined as having history of physician-diagnosed CHD. Hypertension was defined as having history of physician-diagnosed hypertension or a mean SBP ⩾140 mm Hg, or a mean DBP ⩾90 mm Hg.12 Diabetes was defined as a history of physician-diagnosed diabetes or having a fasting plasma glucose ⩾7.0 mmol/l (126 mg dl−1).13

A rest 12-lead ECG was performed on all participates in the supine position with a paper speed of 25 mm s−1 and amplification of 0.1 mV mm−1. All ECG reports were interpreted and coded according to Minnesota Code Classification system by two independent cardiologists (see Supplementary Information for details).14 If there was any disagreement between two readers, a third expert would be involved to obtain a consensus interpretation.

Statistical analysis

The age-specific incidence was based on original cohort (n=95 622). For baseline characteristics of final analysis data (n=33 186), continuous variables were presented as mean (s.d.) and categorical variables were presented as frequency (percentage). We calculated the number of person-year as the sum of the follow-up times from the baseline to the onset of AF or the last health check-up. Differences in baseline characteristics between incident AF and non-AF were compared by t-test for continuous variables and χ2-test for categorical variables.

Selection of eligible predictors of AF was based on prior reports. Information on self-reported questionnaire (smoking, drinking and history of CHD), anthropometric markers (body mass index, SBP, DBP, heart rate, VVV in SBP and VVV in DBP), biochemical examinations (fasting blood glucose, total cholesterol, triglyceride, high-density lipoprotein, low-density lipoprotein, uric acid, total bilirubin, serum creatinine, serum albumin, serum globulin, blood urea nitrogen, hemoglobin, white blood cell, red blood cell and platelet count), ECG markers and diagnosis of hypertension and diabetes were included as candidate predictors of AF. First, Cox proportional hazard models were used to estimate the age- and sex-adjusted hazard ratios of each risk factor. Then, significant risk factors were selected into multivariable cox proportional model and backward elimination method was used to deal with highly collinearities of covariates. For the validation of prediction model, receiver operating characteristic curve analysis was applied. The area under receiver operating characteristic curve (AUC) was used to estimate the discriminative accuracy of AF prediction model. The predicted 2-year risk of each individual was estimated and the ratio of observed risk with predicted risk as well as Hosmer Lemeshow test were used to evaluate the calibration.

P-values <0.05 were considered statistically significant. All data analyses were performed by SAS statistical software version 9.4 (SAS Institute, Inc., Cary, NC, USA).

Results

Figure 1 demonstrates the incidence of AF by different age groups for original cohort (n=95 622). As the incidence of AF was in a very low level before 45 years (<0.40 per 1000 person-years), the final analysis included participants aged 45–85 years for developing AF prediction model (n=33 186, see Supplementary Table S1 for the characteristics of original health check-up database).

Table 1 presents baseline characteristics of the cohort (n=33 186). Among the 33 186 participants aged 45–85 years, 66.96% were men and the mean age was 56.69±9.85 years. A total of 2.73% had a history of CHD, 44.87% had hypertension, 12.61% had diabetes, 34.03% had dyslipidemia and 50.17% were ever smokers, whereas 45.48% were current drinkers. The means and s.d. of biomedical biomarkers are presented in Table 1. Age, sex, SBP, VVV in SBP, VVV in DBP, alanine transaminase, uric acid, serum creatinine, blood urea nitrogen, platelet count, history of CHD, hypertension, left high-amplitude waves, AV conduction defect and arrhythmias including premature beats were significantly different between incident AF and non-AF groups (P<0.05).

Table 1 Baseline characteristics of the cohort

Among these 33 186 participants, during the 83 945 person-years of follow-up (median=2.6 years, see Figure 2), 134 AF cases were observed and the crude incidence density of AF was 1.60 per 1000 person-years for individuals aged 45–85 years.

Table 2 shows the age- and sex-adjusted hazard ratios of AF. Several factors were associated with incident AF at P<0.05 level after adjusting age and sex, including age, sex, history of CHD, hypertension, SBP, DBP, VVV in SBP, VVV in DBP, left high-amplitude R waves and premature beats. We observed a higher risk of incident AF in men, older subjects, those with a history of CHD, higher baseline blood pressure, higher VVV in blood pressures and individuals with ECG features of left high-amplitude R waves and premature beats.

Table 2 Age- and sex-adjusted Cox model for prediction of atrial fibrillation

The hazard ratios and P-values corresponding to the prediction models of AF were presented in Table 3. We first developed a simple prediction model by using age, sex, history of CHD and hypertension as predictors (Simple model). Then, by using backward elimination of multivariable Cox regression, with age and sex, history of CHD, hypertension, SBP, DBP, left high-amplitude R waves and premature beats as candidate predictors, hypertension was selected into the model, together with age, sex, history of CHD, left high-amplitude R waves and premature beats (ECG model). Then, we added VVV in SBP and VVV in DBP as candidate predictors and also using backward elimination, age, sex, history of CHD, VVV in SBP, VVV in DBP, left high-amplitude R waves and premature beats were independent predictors of AF and finally entered into the AF prediction model (VVV model). In the VVV model, the hazard ratios of age, male, history of CHD, VVV in SBP, VVV in DBP, left high-amplitude R waves and premature beats were 1.069, 2.065, 1.844, 1.033, 1.069, 2.338 and 2.378, respectively. Supplementary Figures S1–S3 present the score system of the simple, ECG and VVV models, respectively.

Table 3 Prediction models of atrial fibrillation

The AUC of three prediction models were 78%, 80% and 82%, respectively, and after 10-fold cross-validation, the AUC became 77%, 78% and 79% for predicting risk of AF. The AUC showed good discriminatory accuracy for the prediction of AF risk. The calibration (ratio of observed risk with predicted risk) of these three models were 1.001 (P for Hosmer Lemeshow test=0.8032) for simple model, 1.002 (P=0.8673) for ECG model and 1.004 (P=0.5396) for VVV model.

Discussion

Our analysis of the Shandong multi-center health check-up longitudinal cohort showed the AF incidence density of 1.60 per 1000 person-years. For incidence of AF in China, previously, only one cohort study was conducted in China to explore the incidence of AF. It revealed that the incidence of AF in China was 0.5 and 0.7 per 1000 person-years for participants aged ⩾20 and ⩾50 years, respectively, by using 2001 medical insurance database.8 In our study, we observed 134 incident AF cases during 83 945 person-years follow-up in 2004–2014, with AF incidence density of 1.60 per 1000 person-years. As data on incidence of AF in China was limited, this study was valuable for providing updated AF incidence data for urban Han Chinese population based on health check-up database.

A series of behavioral, clinical, biochemical and ECG markers had been identified as risk factors of incident AF, such as markers associated with atrial stress, inflammation, kidney function, CHD and so on.15, 16, 17, 18 In this study, we tested all behavioral, anthropometric, biochemical and ECG factors available in the routine health check-up. After adjusting age and sex, history of CHD, hypertension, SBP, DBP, VVV in SBP, VVV in DBP, left high-amplitude R waves and premature beats were significantly associated with incident AF. The associations of age, sex, CHD and hypertension with risk of AF were well established and age, sex, history of CHD and hypertension had been included in various prediction models to estimate risk of incident AF.19, 20 Beyond hypertension, the VVV in blood pressure has emerged as a significant predictor of overall mortality and cardiovascular events, and has proven to be correlated with subclinical markers of vascular dysfunction.21, 22 For ECG markers, left high-amplitude R waves indicating left ventricular hypertrophy was a predictor for risk of several adverse cardiovascular events.23 Atherosclerosis Risk in Communities study also included ECG-defined left ventricular hypertrophy as one predictor of incident AF.24 Premature beats tested by ECG was a new predictor of AF found by our study. Premature beat was a common type of arrhythmias in clinical practice and had been identified to be an independent predictor of AF in patients with cryptogenic stroke.25 By adopting multivariable Cox regression, we developed three prediction models, with age, sex, history of CHD and hypertension as independent predictors (simple model); with age, sex, history of CHD, hypertension, left high-amplitude R waves and premature beats as independent predictors (ECG model); and with age, sex, history of CHD, left high-amplitude R waves and premature beats, VVV in SBP and VVV in DBP as independent predictors (VVV model). All the predictors were independently associated with the risk of AF and finally included in the prediction models of AF.

To the best of our knowledge, this study was the first large prospective cohort study conducted in Asian countries, which aimed to develop prediction model of incident AF. Three predictive models had been developed to predict AF risk derived from US population. The first prediction model was Framingham Heart Study score, which was derived from White Americans, getting an AUC of 0.78 for predicting AF.9 Then, it had been evaluated in White and non-White US population and developed the second score.24 In 2013, three cohort studies in the United States were pooled and got the Cohorts for Heart and Aging Research in Genomic Epidemiology-AF model, with AUC of 0.747.26 These prediction models were all based on US population and none had been validated in Asian population. It was well-known that prediction models developed in certain ethnic groups tend to overestimate or underestimate the risk in other ethnic groups.27, 28, 29, 30 As the epidemic of AF in China is quite different from the United States, we have reasons to suspect the applicability of these three AF prediction models in Chinese population. As there was no risk prediction model derived from Chinese population, in this study we aimed to develop risk prediction models by using Chinese health check-up database. The calibration of our models were good, with the ratio of observed risk with predicted risk range from 1.001 to 1.004 (P>0.05). The AUC of our models were 78, 80, and 82%, indicating that the discriminatory accuracy was good in predicting incident AF.

There were several limitations to be considered. First, the participants of this cohort were urban inhabitants occupational or retired in Shandong province and only participants having at least two health check-ups were included in the cohort; the applicability of our results should be validated by further studies. Second, the follow-up duration was relatively short, so we could not predict long-term risk of incident AF. Third, lifestyle habits such as physical activity and novel biomarkers such as B-type natriuretic peptide were not measured in routine health check-up; therefore, we could not explore the predictive ability of these predictors in this study. Fourth, information on antihypertensive treatments was not available in our study, so we could not compare the estimated AF risks of our prediction model with previous developed models.

The strengths of this study included its large sample size, prospective design and abundant potential markers for predicting AF risk. In addiition, our study was the first cohort study conducted in China aimed to develop prediction model to estimate risk of AF and also provided updated AF incidence data for urban Han Chinese population based on health check-up database.

In conclusion, our findings identified predictors of incident AF and developed risk prediction models for AF, which adequately predicted incident AF in health check-up population. The models could help to identify high-risk individuals and define targeted population for primary prevention of AF in clinical and public health practice.