Introduction

Diabetes mellitus (DM) is a well-established independent risk factor for cardiovascular disease (CVD) [1,2,3]. To reduce CVD morbidity and mortality among DM patients, multifactorial preventive strategies should be adopted according to one’s specific risk profile. These facts yield a need for reliable and accurate CVD risk assessment to inform patients about their risk status and guide effective and cost-saving preventive interventions. The “coronary heart disease (CHD) risk equivalent” approach [4], or the idea of taking DM as a universally high-risk group as those who already have CHD, is the most simplistic way to classify CVD risk for patients with DM. Today, contradictory evidence suggests an overall lower CHD risk among patients with DM compared to those without DM but with a prior CHD [5]. Even within the diabetic population, heterogeneous CVD risk exists due to comorbidities or severity of the DM itself [6]. This has made the risk scoring tools an appropriate approach in CVD risk assessment in persons with DM.

Over the past decades, various CVD risk scoring systems have been developed to help CVD risk assessment for persons with DM [7]. In this review, we briefly introduce the concept of CVD risk and its use in the clinic and in research. We then thoroughly review the methodological advances in the development and validation of CVD risk scores for persons with DM. We also present the latest CVD risk scores specifically for the DM population from different countries and regions.

Cardiovascular Disease Risk Scores

CVD risk scores, sometimes called CVD risk engines or CVD risk calculators, are continuous scores that integrate CVD-related risk factors with different weights and are used to estimate the risk of CVD. The Framingham risk score (FRS) was the first developed CVD risk score, and since then, many other CVD risk scores have been developed to predict CVD risk for different populations [8,9,10,11]. The Reynolds Risk Score was originally designed for CVD-free women [12] and the UK Prospective Diabetes Study (UKPDS) risk engine was specifically created for the diabetic population [13]. Other scores were developed for those with prior CVD [14]. Some countries usually use their own population to develop a CVD risk score instead of borrowing an outside calculator [15, 16].

A well-constructed CVD risk score may be referenced in CVD management guidelines. In the clinic, CVD risk scores are often used by health care providers to help identify the high-risk population and further guide the preventive treatment [17]. For instance, the 2013 ACC/AHA cholesterol management guideline recommends initiation of statin treatment among those 10-year atherosclerotic CVD risk ≥ 7.5% [18]. Most recently, these risk scores have been made as online risk calculators or mobile apps that allow non-professional individuals to evaluate risk. Such self-awareness of disease risk may be a good impetus to promote risk-reduction behaviors. In research, existing risk scores are usually used as the reference model to be compared when new CVD risk model is being developed. Another important use of CVD risk scores is similar to exposure-based propensity scores to reduce the dimension of individual covariates to control confounding [19].

Development and Validation of CVD Risk Score for DM Patients

There are important components and criteria for developing CVD risk scores [20, 21]. D’Agostino et al. summarized the complete process of development and validation for a CVD risk score into the following 10 steps: (1) endpoint (event/outcome), (2) at-risk population, (3) follow-up time, (4) risk factors, (5) mathematical model, (6) estimation (relative and absolute risks), (7) performance (discrimination, calibration), (7) internal validation, (8) external validation (performance and recalibration), (9) new markers, and (10) long-term prediction [21]. With regard to CVD risk scores for DM patients, we often observe variable methods when carrying out these steps and the choice of selection often involves different considerations.

Outcome

For current risk scoring systems, various endpoints have been used such as total CVD, hard CHD, and hard atherosclerotic CVD. It seems that there is no best choice on whether we should include soft events in CVD endpoints among DM patients. DM patients are featured with longer asymptomatic or preclinical period for CVD and suffer more silent coronary heart events than their non-diabetic counterparts. Therefore, only counting the hard events may not catch all the true events and lead to the underestimation of the risk score and thus bad calibration. On the other hand, the softer endpoints such as percutaneous coronary interventions, bypass surgery, and coronary revascularization are more prone to be misclassified and lead to poor discrimination, and the time of occurrence of certain endpoints such as angina, angiographic disease, or coronary calcium can be quite uncertain. The FRS/UKPDS validation study in the ADVANCE cohort did find that when any CVD event was used as endpoint instead of hard CVD event, calibration was better while c-statistics (discrimination) were poorer [22]. In addition to these commonly seen endpoints, CVD mortality, CHF, and even microvascular complications have been used in risk models for diabetes patients.

At Risk Population

It seems that the choice of target population is obvious here, which should be patients with diabetes. However, the most commonly used CVD risk scores in clinic are usually those designed for the general population that includes both those with and without DM. Such risk scores usually include DM as a binary factor, ignoring its heterogeneity in risks and interaction with other risk factors. In contrast, CVD risk scores specifically developed for DM patients include more DM-related variables such as HbA1c and DM duration. A study that directly compared the performance of general population vs diabetes-specific CVD risk models in DM patients showed discriminatory advantage of diabetes-specific over general population-based models for CVD risk stratification in diabetes [23].

The type of DM also matters. In fact, there are many more CVD risk scores for T2DM patients than for all DM patients or T1DM patients. One reason is that many of these risk scores were developed using large sample-sized clinical trial cohorts, which specifically targets T2DM [13, 24••, 25]. Such clinical trial data includes information on many cardiometabolic variables, thus providing great pool of candidate predictors when developing a risk score. Although observational studies include different types of DM patients, the small proportion of DM patients, missing information on DM type, and fewer DM-related variables limits the use of such data to develop risk scores for all DM patients. Given that T1DM are much fewer than T2DM in the population, CVD risk scores for T1DM usually use data from medical record, or national health registry [26, 27].

Candidate Risk Factors

In addition to traditional risk factors seen in general population, DM severity indicators, i.e., HbA1c, retinopathy, and kidney functions, i.e., urinary albumin creatinine ratio (UACR) are commonly found in CVD risk scores for DM patients. Such measures may be relatively normal among non-DM subjects and do not have as strong a predictive value as in DM patients. On the contrary, various measures that indicate the severity and complications of DM may play important role in CVD risk assessment for those with DM.

Novel risk predictors include biomarkers and subclinical atherosclerosis measures. We previously reviewed the advances of these novel risk factors in risk assessment, including C-reactive protein (inflammatory biomarker), lipoprotein (a), low density lipoprotein and high density lipoprotein particles, lipoprotein-associated phospholipase A2 (lipid biomarkers), coronary artery calcium, carotid intima media thickness (subclinical atherosclerosis measures), etc. [7]. Biomarkers representing pathophysiological processes of atherosclerosis, such as growth differentiation factor 15, N-terminal pro B-type natriuretic peptide, and high-sensitive troponin T, were also found to enhance CVD risk prediction among diabetics [28]. All these potential new risk factors were independently associated with future CVD risk; however, whether or not they can improve a risk model needs further examination. Yeboah et al. investigated the additional predictive ability of six novel risk factors in the general population using C statistics and found CAC to be the single strongest predictors for CVD events in addition to traditional risk factors [29]. Fewer studies attempted to develop CVD risk scores to formally include these novel risk factors. The MESA-HNR Diabetes CHD Risk Score explored multiple subclinical atherosclerosis measures and CAC measure was added in the final risk score [30].

Unlike certain types of cancer, CVD may have multiple genetic predispositions. Therefore, single gene is unlikely to have predictive ability for CVD events. However, other forms of genetic information may be included. With the help of genome-wide association studies (GWAS), different single-nucleotide polymorphisms (SNPs) can be identified and then summarized into a weighted genetic risk score (GRS) [31]. Such GRS are found to be correlated with CVD independent of traditional risk factors in various studies. When incorporated with traditional risk factors, prediction (assessed by c-statistics) was found to be improved [32].

Risk Estimation

Risk estimation involves the selection of risk factors and calculation of risk scores. A wide variety of methods have been used to select the risk factors to be included. The purpose of risk factor selection is not only to include the risk factors that are associated with CVD events and can improve the accuracy of risk scores, but also to control the total number of risk factors to avoid over-complex equation or high cost related to the tests. The most commonly used selection method is a stepwise selection based on a prespecified p value. More recently, machine learning methods have gained popularity and were increasingly used to select risk factors. The Pooled Cohort Equation adopted by current AHA/ACC guidelines used random survival forest method to determine the “importance” parameter of potential risk factors [17].

Once the risk factors are selected, they need to be integrated into a single-risk score using certain mathematical models. The early Framingham Risk Score for CHD events used logistic regression model to generate the score [33]. Logistic regression model treated event as binary outcome and neglects the timing of event. Poisson regression models have also been used alternatively to estimate incidence rates assuming homogenous risk over time. Currently, the most widely used model is the Cox proportional hazard regression model, which makes no assumption about baseline risk and time. In the Cox model, a baseline survival at the target time frame, i.e., 10 years, is needed. Other models involving event time such as Poisson model, Weibull model, and accelerated failure time model are sometimes used instead of the Cox model.

Evaluation of Risk Score Performance

Binary outcome and time-to-event outcome have a number of evaluation methods regarding prediction performance such as R2, goodness of fit test, mean squared error, C-statistics, etc. C-statistics are routinely reported in almost all risk scores’ performance evaluation stage. The Harrell’s C-statistic is specially designed for time-to-event outcome and is similar to area under the ROC curve for binary outcome which lies between the value of 0.5–1. The C-statistic enables the quantification of predictive ability of risk scores but is also found to be conservative in pair-wise comparison when the two compared models have large overlap of covariates or when the old model is already good enough [34]. The net reclassification index (NRI) has been frequently used since its introduction in 2008. Compared to C-statistics, the categorical NRI has greater clinical relevance and importance when two risk scores are compared. However, NRI, especially category-free NRI, sometimes suffers a false positive problem in reporting significant results [35].

Before a CVD risk score can be used in diabetic patients, the score also needs to be tested in other external DM cohorts. Performance of a risk score is more heterogeneous and generally poorer in external validation. Since many of the CVD risk scores for the general population can also be used in DM patients, they usually serve as a reference model in head-to-head comparisons with the DM-specific CVD risk models. More importantly, all these risk scores were derived from older cohorts dated back years ago during which the baseline risk factors, disease incidence and preventive management were notably different from the contemporary population. This temporal disparity may also have a negative impact on the performance of risk scores.

An Update: New CVD Risk Scores for DM Patients in Different Countries

In 2012, Van Dieren et al. systematically reviewed 45 CVD risk scores that can be used for T2DM population, among which 12 were T2DM-specific CVD risk scores [36]. Here, we summarize the newly emerged risk scores for both T1DM and T2DM as well as updates of the old DM-specific CVD risk scores (see Table 1). These new CVD risk scores have used new types of endpoints, new dataset from contemporary cohorts, or elongated follow-up from original derivation cohort.

Table 1 Cardiovascular risk prediction scores in diabetes

United Kingdom

The UKPDS risk engine is the earliest and the most well-known CVD risk scoring system for patients with diabetes. The first UKPDS risk engine was dated back to 2001 and was developed to separately predict 10-year CHD and stroke instead of composite CVD events [13, 45]. Post-trial follow-up of UKPDS cohort continues contributing to a new set of CVD risk scores called UKPDS Outcomes Model (UKPDS-OM). The current version of OM is OM2, which has significant distinction from the old UKPDS risk engine [37]. The OM2 predicts lifetime risk of seven primary complications of myocardial infarction, ischemic heart disease, stroke, congestive heart failure, amputation, blindness, and renal failure. Additional models were also developed for total mortality, diabetic ulcer, and some second events. OM2 does not provide estimate of risk for composite CVD events. The study directly compared observed vs. predicted event for internal validation. External validation showed tendency of underestimation of risk [46].

England researchers developed a risk score as part of QDiabetes risk score for 10-year heart failure risk from 437,806 diabetic patients in general practice [38]. Unlike the OM2 for heart failure, the model can be used in both T1DM and T2DM and in both with or without prior CVD. The risk score included common risk predictors of age, BMI, SBP, cholesterol/HDL ratio, HbA1c, material deprivation, ethnicity, smoking, duration and type of DM, atrial fibrillation, cardiovascular disease, and chronic renal disease. External validation on two large cohorts from England showed satisfactory AUC (ranged 0.76–0.78) and excellent calibration.

United States and Canada

To date, only two CVD risk scores were developed for DM populations in the USA and both have used the Atherosclerosis Risk in Communities Study (ARIC) cohort. The old one predicted CHD risk with a basic model of 8 predictors and a full model of 17 predictors [39]. The new ARIC CVD risk score for DM estimated 10-year CVD risk and successively explored four clusters of predictors with four models [47]. Model 1 with 13 self-reported risk factors had C-statistics of 0.667 and model 4 with self-report risk factors, clinical measured risk factors, HbA1c and 12 novel biomarkers reached a C-statistic of 0.714. Each inclusion of an additional set of predictors significantly improved C-statistics.

Basu et al. developed a comprehensive risk scoring system named RECODe (Risk Equations for Complications Of type 2 Diabetes) for microvascular and macrovascular complications using the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial cohort recruited from USA and Canada [24••]. Multiple risk scores were developed to predict 10-year risk of each microvascular complications including nephropathy, retinopathy and neuropathy, and macrovascular endpoints of myocardial infarction, stroke, congestive heart failure, and cardiovascular mortality as well as composite endpoints of ASCVD and total mortality. Internal and external validation achieved moderate discrimination and good calibration. When compared with UKPDS and PCE, statistically significant improvement in NRI was observed [48]. The RECODe risk score for ASCVD events includes CVD history as one significant predictor, therefore may be applicable to those with past history of CVD.

Another set of risk assessment models for DM was developed using the Diabetes Control and Complications Trial and the Epidemiology of Diabetes Interventions and Complications (DCCT/EDIC) cohort from 29 clinic centers in the USA and Canada [40]. Although the developers claimed the models for all DM patients, the whole derivation cohort is only composed of T1DM. The risk models predict micro- and macrovascular complications as well as adverse events of hypoglycemia and ketoacidosis that are commonly seen in T1DM. External validation showed general better performance in T1DM than T2DM with the exception of predicting long-term CVD events.

Sweden

Two CVD risk scores for T2DM and T1DM were developed in Sweden [27, 49]. The Swedish National Diabetes Registry (NDR) risk score for T2DM was featured with the wide age range from 18 to 70 years old and simple achievable risk profiles, with HbA1c as the only lab test [49] to predict first-time CVD in 5 years. An important difference of the T1DM NDR risk score is that it is applicable to T1DM patients both with and without prior CVD. In this model, CVD history was identified as an independent predictor with seven common predictors including DM onset age, diabetes duration total cholesterol/HDL-C ratio, HbA1c, SBP, smoker, and macroalbuminuria. External validation showed excellent discrimination and calibration [27].

Germany

In Germany, a CVD mortality risk score for DM patients, called VILDIA risk score, investigated 135 potential risk factors (mostly biomarkers) and eventually selected NT-proBNP, age, male sex, renin, diabetes duration, Lp-PLA2, and 25-OH vitamin D3 using bootstrapping stepwise selection [41]. The VILDIA CVD mortality risk score showed better discrimination for CVD mortality in an external German cohort than UKPDS risk engine. Since the UKPDS risk engine and the VILDIA risk score predicted different endpoints, it is usually not recommended to make such direct comparison.

Denmark

The Steno Type 1 Risk Engine, a 5-year CVD risk score for T1DM in Denmark, was developed with outpatient clinic data and has shown excellent performance [26]. It is so far the largest CVD-free T1DM cohort used to develop CVD risk score. The risk engine includes age, sex, diabetes duration, SBP, LDL-C, HbA1c, albuminuria, glomerular filtration rate, smoking, and exercise.

China

Three independent 5-year risk scores for CHD, HF, and stroke were previously developed for T2DM patients using population-based data in Hong Kong [36]. Recently, Hong Kong researchers developed new 5-year CVD risk scores for T2DM patients [43•]. The baseline derivation cohort was in 2010 and therefore is the most recent derivation cohort among all CVD risk scores. The set of risk scores had two versions with one using fewer risk factors. In both models, age, smoking, HbA1c, SBP, TC/HDL-C ratio, and eGFR were identified as predictors. The complete model additionally included diabetes duration, usage of anti-hypertensive drug and insulin, BMI, DBP, and UACR. The new model showed superior discrimination and calibration ability to other CVD risk scores for DM patients. The risk scores still need to be validated among DM patients in mainland China before being applied in the whole country.

ADVANCE Risk Score and AD-On Risk Score

The Action in Diabetes and Vascular Disease (ADVANCE) clinical trial enrolled high CVD risk participants with DM from across 20 countries in Asia, Australasia, Europe, and Canada. Despite the short follow-up of main trial, ADVANCE researchers managed to develop a 4-year CVD risk score from ADVANCE participants that may be applicable to T2DM in diverse countries [25]. Retinopathy was identified as one of the DM-related predictors in the final model. In external validation, the ADVANCE CVD risk score showed barely modest discrimination and a tendency to underestimate risk regarding calibration, partially due to the BP and glycemic intervention [50]. ADVANCE researchers also developed 5-year risk scores for early and late stage of renal disease [51]. C-statistics were 0.847 for major kidney-related events and 0.647 for new-onset albuminuria.

With the longer post-trial follow-up, ADVANCE-On Project further developed 10-year risk scores for CVD and major renal disease [44]. In the new risk score, age, sex, SBP, antihypertensive medication, duration of diabetes, HbA1c, UCAR, eGFR, age at completion of formal education, exercise, history of diabetic retinopathy, and atrial fibrillation were included. Discrimination of the 4-year and 10-year risk scores were similar for both 4-year and 10-year events. The risk scores for renal events performed better with C-statistics around 0.8 mainly due to the highly predictive renal function measures like eGFR and UACR.

Conclusion

In the era of personalized health care and evidence-based medicine, discussion of future CVD risk with DM patients with a quantitative, easy-to-understand risk score based on the patient’s own risk profile is important for informing much appreciated. Popularity of a risk score generally depends on the cost and convenience of those tests used in the risk score. For risk scores based on routine tests, calculation of the risk score brings no additional medical expenditure and is usually favored. When specific tests such as CAC scanning or genetic tests are needed for the purpose risk assessment, a clinical trial and cost-effectiveness assessment should be carried out, specifically for the DM population, to demonstrate the net benefit on outcome before the advocation of the risk score.