Introduction

The association between exaggerated blood pressure (BP) reactions to acute psychological stress and hypertension is well established. Supporting evidence comes from several independent epidemiological data sets that have shown exaggerated systolic (SBP) and/or diastolic (DBP) BP reactivity to acute psychological stress to be linked with increased resting BP at 6.5- and 12-year follow-up1, 2 and to predict hypertension diagnosis at 13-year follow-up.3 In addition, being ‘high risk’ for developing hypertension based on parental history or having elevated resting BP is associated with exaggerated BP stress reactivity.4 Importantly, a large meta-analysis also has established a positive association between BP stress reactivity and hypertension.5

In contrast, the relationship between stress-induced heart rate (HR) reactivity and hypertension remains equivocal. Relatively increased HR reactivity has been observed among individuals with parental history of hypertension6 and several small-scale studies have reported a positive association between HR stress reactivity and increased 1-year ambulatory SBP7 and incident mild hypertension.8 However, a relationship between HR reactivity and elevated BP has failed to emerge from epidemiological studies3 or meta-analysis.5 Further complexity is added by findings of negative associations between HR stress reactivity and hypertension risk factors such as obesity and the use of addicting substances such as alcohol and tobacco. In each case, the obese,9 smokers10 and those dependent on alcohol11 all exhibited blunted rather than exaggerated HR responses to acute psychological stress. Accordingly, it may be timely to take a more nuanced look at the relationship between cardiovascular stress reactivity and hypertension.

It has been suggested that focusing on a single cardiovascular reactivity variable may be limiting in scope, as evidence has shown that different patterns of end-organ responses carry differential risk for disease and that focusing on multivariate patterns of stress reactivity may be more informative.12 With regard to BP and HR this makes sense given that these variables are not independent but, in fact, profoundly influence each other; increases in cardiac output increase BP and changes in BP influence HR via baroreceptor mechanisms.13 However, the wide interindividual variation in normal patterns of HR and BP stress responses makes it challenging to define homogeneous groups of subjects. Cluster analysis offers a solution to this problem by assigning subjects from a single large cohort into clusters based on their statistical similarity on a set of variables defined a priori. This approach was undertaken with two goals: (1) to identify clusters of individuals who exhibit significantly different patterns of BP and HR stress reactivity, and (2) to assess whether membership to a particular cluster conferred increased/decreased risk of hypertension diagnosis at 5-year follow-up.

Materials and methods

Participants

Participants were from the Dutch Famine Birth Cohort, which comprised 2414 men and women born in Amsterdam during 1943–1947. The study was designed to examine the health consequences of prenatal famine exposure. Hence, it may be suggested that this population characteristic may limit the generalisability of the present study results. However, this is unlikely as, although, famine exposure early in gestation, defined as a 13-week period where daily caloric intake was below 1000 calories,14 was associated with poorer adult health,15 only 58 (8.6%) individuals in the present sample were exposed to famine during early gestation.

The selection procedures and loss to follow-up (e.g., unable to contact, death, emigration) have been described in detail elsewhere.16 All 1423 members of the cohort who lived in the Netherlands on 1 September 2002 were invited to the clinic to undergo stress testing from 2002 to 2004; 740 attended. Analyses comparing individuals who did not participate in the stress testing wave (n=683) with those who did participate showed that there were no differences in sex (P=0.49) or birth weight (P=0.42). There was a small significant difference in age (58.3 vs 59.2 years, respectively, P<0.01). From 2004 to the follow-up in 2008–2009, 31 persons had died, 6 had emigrated, 10 had an unknown address and 4 had requested their address to be removed from the database, leaving 1372 eligible cohort members of whom 601 participated. Participants self-reported whether or not they had ever received a diagnosis of hypertension from a physician. The mean (s.d., range) temporal lag between stress testing and the hypertension follow-up interview was 5.5 (0.60, 4.0–6.8) years. Of the 740 participants in 2002–2004 who participated in stress testing, 480 also participated in 2008–2009 follow-up; 121 were participants new to the cohort. Participants who dropped out (n=260) did not differ from those re-attending (n=480) in sex (P=0.80) or birth weight (P=0.18), but there was a small difference in age (58.2 vs 58.4 years, respectively, P=0.05). The study was approved by the local Medical Ethics Committee, carried out in accordance with the Declaration of Helsinki and informed consent was obtained from all participants.

General study parameters

In the 2002–2004 stress testing sessions, research nurses gathered anthropometric measurements and collected socioeconomic status (SES), education and lifestyle data during a standardized interview. Height was measured twice using a fixed or portable stadiometer and weight was measured twice using Seca and portable Tefal scales. Body mass index (BMI) was calculated as weight (kg)/height (m2) from the averages of the two height and weight measures. SES was defined according to the International Socio-Economic Index (ISEI)-92, which is based on the participant’s or their partner’s occupation, whichever has the higher status.17 Values on the ISEI-92 range from 16 (low status) to 87. The Hospital Anxiety and Depression Scale18 (HADS) was used to assess depression as this variable has been shown to relate to both reactivity and hypertension in this data set.19, 20 Education level was measured on a 10-point scale (1=primary education not completed, 10=university completed). Alcohol consumption was recorded as the number of units consumed per week; one unit was defined as one glass of an alcoholic beverage. On the basis of self-report, participants were characterised as current, ex or never smokers and also indicated whether or not they were currently taking antihypertensive medication.

Psychological stress protocol

Stress testing was carried out in the afternoon between the hours of 1200–1400 following a light lunch. A formal 20-min baseline was followed by three psychological stress exposures: Stroop, mirror-tracing and a speech task. Each task lasted 5 min and was separated by 6-min between-task intervals; a 30-min recovery phase followed the final stress task. The Stroop task was a computerised version of the classic Stroop colour-word conflict task. After instruction, participants were allowed to practise until they fully grasped the requirements of the task. During the task, a mistake or response over the time limit (5 s) triggered a beep. The mirror-tracing task required participants to trace a star that could only be seen in a mirror image (Lafayette, IN, USA). Participants were allowed to practice one circuit. They were told to give priority to accuracy over speed and that most people could perform five circuits without diverging from the line. Every divergence from the line induced a short beep. Prior to the speech task, participants listened to a prerecorded scenario in which they were told to imagine that they were falsely accused of pick-pocketing. Participants were instructed to give a 3-min response to the accusation and were given 2 min to prepare a response. The responses were recorded on video and participants were told that the number of repetitions, the eloquence and the persuasiveness of their performance would be marked by a team of communication experts and psychologists.

Continuous measures of BP and HR were made during the stress test protocol using a Finometer or Portapres Model-2 (Amsterdam, The Netherlands). There was no difference in reactivity as a function of the two different measurement instruments. Four 5-min blocks were defined as follows: baseline (final 5 min in baseline period), Stroop, mirror-tracing and speech task (including preparation time). Mean SBP, DBP and HR were calculated for each period.

Statistical analysis

Baseline SBP, DBP and HR were the averages of measures recorded during the 5-min period 15 min into the formal baseline. Initially, cardiovascular measures were averaged across the three tasks to obtain a stress period average for each variable, as this has been shown to create a more reliable marker of individual differences in trait stress reactivity.21 However, separate analyses were then run for each stress task to examine whether the results differed when reactivity to each task was considered separately. Stress reactivity was defined as the difference between stress and baseline averages for SBP, DBP and HR. A repeated-measures ANOVA, comparing baseline and stress task values, was carried out to confirm that the stress tasks perturbed cardiovascular activity. Partial eta squared and hazard ratios (HR) are reported as measures of effect size.

Cluster analysis was carried out using Ward’s method22 in SPSS version 22 (IBM Analytics, Chicago, IL, USA). Raw reactivity scores for SBP, DBP and HR were converted to z-scores to ensure that the cluster analysis was not influenced by the scale of individual variables. Ward’s method begins with the same number of clusters as cases. In each subsequent step, cases are combined, forming one less cluster than before. For each cluster, a within-cluster sum of the squared Euclidean distances between individual scores and the mean of each variable in that cluster is calculated; the smaller the sum of squares, the greater the similarity between individuals in the cluster. A total sum of squares is then calculated across all clusters. Ward’s method determines which two clusters will produce the smallest increase in the total sum of squares when they are merged. Eventually, the merger of two dissimilar clusters will cause a substantial increase in the total sum of squares. The state of the clusters just prior to this point is considered the ‘natural solution’ to the clustering process. Follow-up one-way ANOVAs were carried out to determine whether clusters differed significantly on mean SBP, DBP and HR reactivity. As data were normally distributed, between-cluster differences in general study parameters were tested with one-way ANOVAs and chi-squared analysis. The cluster with the lowest risk of hypertension was used as a reference cluster and was identified using chi-square analysis. First, an uncontrolled binary logistic model was used to assess whether cluster membership in 2002–2004 predicted reported physician diagnosis of hypertension at the 2008–2009 follow-up. Next, the same model was revisited, controlling for hypertension medication use at time of stress testing to control for those already hypertensive. Finally, a fully adjusted model was used to control for potential confounders: SES, BMI, sex, age, HADS-depression score, smoking status and alcohol consumption. These confounders were identified as being established risk factors for hypertension that were available for analysis and significantly differed across the clusters.23, 24 and 25 Exploratory cluster and binary logistic regression analyses were undertaken with cardiovascular measures from each individual task to determine if the cluster results or the relationship between-cluster membership and hypertension differed across stress tasks.

Results

Study population

Of the 740 cohort members, 721 completed the stress protocol. Cardiovascular data were unavailable for four participants. Incomplete cardiovascular data due to technical problems, participant exclusion, due to significant arrhythmia, determined during cardiovascular data processing, and removal of two statistical outliers (>5 s.d. above mean) left an effective sample size of 669, which is substantially above the suggested sample size of 2m needed for cluster analysis, where m is the number of clustering variables.26

Stress reactivity

The stress task battery significantly perturbed SBP, F(1, 668)=2511.21, P<0.001, η2=0.79; DBP, F(1, 688)=579.69, P<0.001, η2=0.47; and HR, F(1, 668)=165.48, P<0.001, η2=0.20; in all cases cardiovascular activity increased in response to stress. The overall magnitude of the cardiovascular perturbations is shown in Figure 1.

Figure 1
figure 1

Means of systolic (SBP), diastolic (DBP) and heart rate (HR) reactivity in mmHg or beats per minute for overall sample and individual clusters. HR reactivity is significantly different across all clusters, with the exception of clusters 2 and 4. SBP and DBP reactivity is significantly different across clusters with the exception of clusters 1 and 4. Error bars represent s.e. of the mean.

Cluster analysis

Based on the criterion discussed for selecting the appropriate number of clusters, SBP, DBP and HR reactions to the stress task battery were found to resolve to four distinct clusters. The means and standard errors for SBP, DBP and HR reactivity for each cluster can be found in Figure 1. Results of independent one-way ANOVAs and post hoc analyses showed that all the clusters were significantly different from each other on all cardiovascular variables (P<0.05) with a few exceptions: clusters 1 and 4 did not significantly differ in SBP or DBP reactivity (both P>0.45), and clusters 2 and 4 did not significantly differ in HR reactivity (P=0.56). Whereas cluster 2 was characterised by reactivity values mostly in line with the sample averages, the other clusters were different in several respects. Individuals in cluster 1 registered exaggerated HR and BP responses while individuals in cluster 3 exhibited an overall blunted reactivity profile. Finally, individuals in cluster 4 mounted an exaggerated BP response equal to that of cluster 1 but only a modest HR response statistically equal to that of cluster 2.

Analysis of general study parameters revealed several significant differences between the clusters (Table 1). Significant between-cluster differences (P<0.05) were found for education, SES, BMI, HADS-depression score, baseline DBP, sex and smoking status. There were no significant cluster differences in baseline SBP or HR, age, alcohol consumption, dropout, and hypertension medication use at the time of stress testing.

Table 1 General study parameters of clusters 2002–2004 wave (N=669)

Cluster risk for hypertension

Hypertension status was recorded for 438 participants in 2008–2009. There was no significant difference in HR or BP stress reactivity between those who participated in the follow-up and those who did not. Analysis of general 2002–2004 study parameters in the follow-up sample revealed significant differences between the clusters in education, SES, BMI, HADS-depression score, hypertension medication use at time of stress testing and smoking status; age, sex, baseline cardiovascular variables and alcohol consumption did not significantly vary across clusters (Table 2). In all, 211 (48%) reported having received a diagnosis of hypertension from a physician in the 2008–2009 follow-up. Binary logistic regression confirmed a relationship between 2002–2004 cluster 4 membership and increased risk of hypertension at 2008–2009 follow-up (Table 3). To assure that this relationship was not influenced by those already hypertensive at the 2002–2004 stress testing session, this analysis was revisited and adjustment for hypertension medication use at the time of stress testing; results survived adjustment (Table 3). Finally, to control for potential confounders education, SES, BMI, HADS-depression score and smoking status were inserted as covariates; cluster 4 membership was still significantly related to increased risk of hypertension at follow-up (Table 3).

Table 2 General study parameters of clusters 2008–2009 wave (N=438)
Table 3 Hazard ratio of physician diagnosis of hypertension by stress reactivity cluster

Exploratory analyses of task specificity

Given that the current study aimed to determine if stable individual differences in stress reactivity predicted individual differences in hypertension risk, we chose to aggregate reactivity measures across the tasks as task aggregation has been shown to result in a more reliable measure of individual differences in stress reactivity.20 However, stress tasks differ in their provoked responses and in their relevance to disease. Consequently, we undertook exploratory cluster and binary logistic regression analyses for each task individually. Individual cluster analyses for the speech and Stroop tasks resulted in the same clusters as the main analysis and in both cases the cluster characterised by exaggerated BP, but only modest HR reactivity had significantly increased risk of hypertension (both HRs >1.96 and both P<0.013). Cluster analysis of reactivity values to the mirror-tracing task also revealed four distinct groups that qualitatively were similar in pattern to the other tasks but cluster membership failed to predict hypertension.

Discussion

Using multivariate cluster analysis, four homogeneous clusters of individuals with statistically different SBP, DBP and HR stress reactivity patterns were identified. Further, cluster membership was found to predict increased risk of a physician diagnosis of hypertension at 5-year follow-up. Interestingly, a dichotomy emerged whereby clusters 1 and 4 garnered the smallest and greatest risk of hypertension, respectively, despite mounting statistically equal exaggerated BP stress responses; the between-cluster difference was in HR reactivity where cluster 1 mounted an exaggerated HR response and individuals in cluster 4 registered HR responses similar to the sample mean. This relationship withstood adjustment for various potential anthropometric and socio-demographic confounders and hypertension medication use at time of stress testing. By showing that only individuals characterised by an exaggerated BP reaction and relatively small HR reactions are at increased risk of hypertension, these results support the previously reported prospective relationship between exaggerated BP reactivity and hypertension, but show that this relationship is also dependent on the magnitude of the cardiac response. Lastly, these results critically emphasise the role of multivariate analyses in stress psychophysiology research.

That the cluster characterised by the largest SBP and DBP stress responses had the greatest risk of hypertension at 5-year follow-up was not unexpected. Moreover, this relationship withstood adjustment for hypertension medication use at stress testing and several potential anthropometric and socio-demographic confounders. Although mediation by some other unmeasured factor is possible, it is unlikely, as previous studies have shown the association between exaggerated BP stress reactivity and hypertension to withstand statistical adjustment for other variables such as age, gender and baseline BP.3 What is more likely is that repeated large magnitude surges in BP, induced by mental stress, engage local BP regulatory mechanisms and lead over time to upward structural resetting of the peripheral vasculature.27 Specifically, elevated resting BP results from a positive feedback cycle in which frequent acute surges in BP promote vascular hypertrophy which decreases lumen diameter and increases vessel stiffness, in turn, amplifying future BP fluctuations. Evidence of such processes lies in the reported association of exaggerated BP reactivity with increased carotid intima-media thickness in children,28 adolescents29 and adults,30 and with increased vascular stiffness,31 as well as the propensity for BP reactivity to increase with age.32 It is likely that such physiological processes underlie the development of hypertension in the individuals contained in the cluster that displayed exaggerated BP responses to mental.

An unexpected finding was that the cluster of individuals carrying the least risk of hypertension did not have reactivity values located at the mean but instead had the most exaggerated HR and BP reactions. Hence, compared with the cluster at highest risk of hypertension, which had an equally exaggerated BP response but only modest cardiac response, it would appear that the presence/absence of a robust HR response is, to some extent, a factor in determining hypertension risk. One possible interpretation relates to the early observation that similar BP reactions can result from significantly different changes in cardiac output and total peripheral resistance.12 A spectrum exists in which individuals at the extreme ends modulate BP by primarily augmenting either cardiac output (cardiac reactors) or total peripheral resistance (vascular reactors). Going further, it has been suggested that not only is the magnitude of reactivity significant in the context of disease but that different underlying mechanisms (i.e., relative degree of cardiac output/total peripheral resistance modulation) may carry differential hypertension risk.12 The present results accord with this framework as the individuals in the highest risk cluster registered an exaggerated BP reaction despite only a modest increase in HR, whereas the cluster carrying the least amount of risk mounted an equally exaggerated BP response but also recorded a HR reaction almost 3 × larger than the sample mean. With such differences in cardiac activity between the clusters, it may be that individuals in the cluster with the least risk increased BP by augmenting cardiac output through beta-adrenergic activation and/or vagal withdrawal mechanisms, while the high-risk cluster increased BP primarily through alpha-adrenergic vasoconstriction.33, 34 It is also possible that the reaction patterns exhibited by individuals in clusters 2 and 3 resulted from variations not only in the degree of mixed alpha/beta-adrenergic activation but also in overall magnitude of autonomic reactivity. Hence, these data suggest that not only is the magnitude with which an individual responds to mental stress significant in the context of disease but also underlying multivariate hemodynamic and autonomic mechanisms carry differential risk and should be considered.

Exploratory task-specific analyses revealed that for Stroop and speech tasks the same four clusters emerged that were revealed in the main analyses. Further, the cluster was characterised by exaggerated BP reactivity but only modest cardiac reactivity had significantly increased risk of hypertension. A four-cluster solution emerged for the mirror-tracing task as well although the clusters were slightly different in composition and cluster membership did not relate to hypertension risk. Presumably, because tasks have been shown to provoke different stress responses via different physiological mechanisms33 these results support the notion that stress tasks differ in their relevance to disease outcomes.

The current study is not without limitations. First, it could be argued that an element of subjectivity exists in choosing the clustering algorithm and the final number of reactivity profile clusters. These are issues with all forms of cluster analysis. We chose Ward’s method as it has been widely used in health psychology research35 and precedence for its use exists in stress psychophysiology; two previous studies have used Ward’s method to cluster stress reactivity patterns according to autonomic activity34 and stress task.33 Four clusters were selected for two reasons: a substantial increase in total sum of squares was observed during the iteration decreasing the sample from five clusters to four, and outputs with five or three clusters either had very small clusters with extreme individuals or large, heterogeneous clusters, respectively. Second, the effect sizes in the current study are small. However, they are consistent in magnitude with those observed in other studies,3 and this is not unexpected as hypertension is multiply determined, having aetiological roots in various physical, psychological and behavioural domains.23, 24, 25 Finally, the possibility exists that famine exposure in utero could influence the present results and limit generalisability. However, chi-square analysis revealed that famine exposure did not differ across the clusters (P=0.25) nor did it relate to hypertension diagnosis (P=0.17).

In conclusion, using multivariate cluster analysis, four distinct HR and BP reactions patterns were identified that differed in relative risk of hypertension diagnosis at 5-year follow-up. A profile characterised by exaggerated BP but only modest HR reactivity conferred the greatest risk, while individuals mounting relatively exaggerated BP and HR responses carried the least amount of risk. These results support, but more importantly, add specificity to the established relationship between BP stress responses and hypertension and provide positive reinforcement for the use of multivariate statistical approaches in psychophysiology research.