Introduction

Nearly half of patients thought to have heart failure (HF) have a normal left ventricular (LV) ejection fraction (EF). The term “heart failure with normal ejection fraction” (HeFNEF) is often used as a diagnostic label in these patients [1, 2]. In some studies [3, 4], morbidity and mortality of patients with HeFNEF are similar to those who have left ventricular (LV) systolic dysfunction. The mechanisms underlying HeFNEF are certainly heterogeneous and remain incompletely understood [5].

Recent studies suggest that abnormalities of both systolic and diastolic function co-exist and contribute to the pathophysiology of HeFNEF [6]. The commonest way of assessing LV systolic function is to measure LVEF, which often has a substantial subjective component, is highly dependent on image quality and may not identify small reductions in systolic function. Tissue Doppler is an alternative method for quantifying LV systolic and diastolic function [7] but is limited by angle dependency. Speckle tracking is a recent development based on frame-to-frame tracking of myocardial tissue movements in any direction that allows a more detailed and sophisticated analysis of LV function, including global longitudinal strain (GLS), which may be more useful for the detection of subtle impairment of LV systolic function than LVEF [810].

We investigated whether measuring GLS added clinical or prognostic information in patients referred to an out-patient clinic with symptoms suspicious of heart failure who were found to have a normal LVEF.

Study population

Out-patients with chronic heart failure (CHF) attending a clinic serving a local population of approximately 550,000 between October 2008 and May 2010 were screened. Patients were included in the analysis if their LVEF was ≥50 % and they had symptoms or signs suspicious of CHF.

Patients provided a detailed clinical history and blood tests (including haematology, biochemistry profile and amino-terminal pro-brain natriuretic peptide (NT-proBNP)). Electro- and echo-cardiograms were obtained on the same day. Ischaemic heart disease (IHD) was defined as a past history of myocardial infarction or angiographic evidence of significant (>70 %) coronary artery disease. Patients were classified as diabetic if receiving treatment for diabetes. A diagnosis of hypertension was based on prior medical history or systolic blood pressure >140 mmHg. A medical history of chronic obstructive pulmonary disease and smoking habits were also recorded. Patients in atrial fibrillation or atrial flutter were grouped as “AF”.

Informed consent was obtained from each patient and the study protocol conforms to the ethical guidelines of the 1975 Declaration of Helsinki as reflected in a priori approval by the institution’s human research committee.

Three different subgroups were defined:

  • Group A: Patients with no substantial cardiac dysfunction (LVEF ≥ 50 %, left atrial diameter (LAD) < 4 cm in parasternal long axis, NTproBNP < 400 ng/l): N = 76;

  • Group B: Patients with LVEF ≥ 50 %, but LAD ≥ 4 cm or NTproBNP ≥ 400 ng/l, defined as “possible HeFNEF”: N = 99;

  • Group C: Patients with LVEF ≥ 50 %, LAD ≥ 4 cm and NTproBNP ≥ 400 ng/L, defined as “definite HeFNEF”: N = 138.

We created a congestion score, based on lung auscultation (normal, presence of basal, mid zone or diffuse crepitations), jugular venous pressure (JVP; not visible, raised 1–4 cm, raised to earlobe), peripheral oedema (none, ankles, below or above knees) and liver examination (not palpable, palpable) with one point attributed for each degree of severity. Patients with a score of three or more out of a possible nine were defined as being congested.

We also invited some subjects aged >50 years who were identified from family doctor registers not to have a history of cardiovascular disease to act as a control group.

Patients with HeFNEF were managed with diuretics for symptom relief, and otherwise at their clinician’s discretion. Data regarding hospitalizations and death were collected from the hospital’s electronic systems supplemented by information from patients and their family doctors.

The primary outcome of interest was a composite of admission for worsening heart failure or cardiovascular death (due to terminal heart failure or sudden death out of hospital not explained by cancer or other non-cardiovascular disease). Admission for heart failure was defined as admission for symptoms of worsening heart failure requiring a substantial intensification of treatment (intra-venous diuretic, inotropic or vasodilator therapy, or an increase in oral furosemide dose by ≥40 mg/day). If there was an in-hospital death attributed to worsening or terminal heart failure or acute myocardial infarction during an admission, the end point of CV death was used in preference to the date of hospitalisation as most deaths occurred within 48 h of admission. Patients who died out of hospital who had no other obvious cause of death were considered to have died a sudden cardiac death. To avoid errors due to the mis-attribution of cause of death, a second end point of all-cause mortality was also considered.

Echocardiographic measurements

Echocardiograms were performed by experienced operators in accordance with the recommendations of the British Society of Echocardiography [11] using a Vivid-5 or -7 (GE Health care, UK) system operating at 3.4 MHz. Doppler tracings and two-dimensional images were obtained from parasternal long- and short-axis, apical and subcostal views. Echocardiograms were stored and retrospectively reviewed by a single operator (PP) blinded to other patient details using an EchoPAC station (GE Health care, UK). LVEF was measured using Simpson’s biplane method. LA volume was measured in the four chamber view and indexed to body surface area (LAVI). Tricuspid annular plane systolic excursion (TAPSE) was used to assess right ventricular (RV) systolic function. Pulmonary artery systolic pressure was estimated from the velocity of tricuspid regurgitation based on the modified Bernoulli equation, ΔP = TR velocity2 × 4. With the patient in the supine position, the maximum inferior vena cava (IVC) diameter during the respiratory cycle was measured approximately 3 cm before it merged with the right atrium.

A digital loop was acquired from apical 4-chamber, 2-chamber and 3-chamber views at frame rates between 40 and 80 frames/s for longitudinal strain analysis. Images were analyzed off-line. The strain curve was extracted using dedicated software (GE). Peak systolic strain was defined as the peak negative value on the strain curve during the entire cardiac cycle. Values obtained from a model based on all 18 LV segments were then averaged. GLS was calculated if more than 12 LV segments could be analyzed. GLS was estimated from a single cardiac cycle. However, because speckle tracking analysis is influenced by AF, a sub-analysis was also conducted for patients who were in sinus rhythm (SR) only.

Statistical methods

16 GLS measurements and 276 individual segments were randomly selected and measured separately on two different occasions by an experienced operator (PP). The reproducibility of the GLS measurements, as well as of each segment, was tested using Bland–Altman plots.

Categorical data are presented as percentages; normally distributed continuous data as mean ± SD; and non-normally distributed variables as median and interquartile range. One-way ANOVA and Kruskal–Wallis tests were used to compare continuous variables between groups depending on the normality of the distribution, and the Chi squared test was used for categorical variables.

Associations amongst demographic, clinical, echocardiographic and biochemical variables with prognosis were assessed using the Cox proportional hazards models. Because there were relatively few primary outcome events, two different multivariable models were tested to prevent over-fitting. In Model A, we chose, prospectively, eight candidate variables of interest in addition to GLS and diagnostic category; and for Model B, we selected the eight variables most strongly associated with prognosis in univariable analysis. Forward and backward procedures were used to determine which variables independently predicted the primary composite outcome. Treatment variables were not included in the model as these might be confounded by indication, might vary overtime and are not known to influence prognosis in HeFNEF. We did not include mitral and tricuspid regurgitation in the model, because their estimation was based on a semi-quantitative approach and because we included patients with AF, which might make the interpretation of valvular regurgitation difficult. For the second model, given the high correlation between IVC diameter and TR systolic gradient (R = 0.529, p < 0.001), we selected the IVC diameter, a strong predictor of an adverse outcome [12].

Kaplan–Meier curves with the log-rank statistic were used to illustrate outcome. Assumptions of the models were tested, such as multicolinearity and proportional hazards. All the analyses were performed using SPSS and Stata software. A 2-sided p value <0.05 was considered statistically significant.

Results

Patient characteristics

In the 20 control subjects, the mean age (SD) was 65 ± 11 years, 55 % were women and their echocardiograms were normal (Table 1). Mean (±standard deviation) LVEF was 60 ± 5 %, mean GLS by speckle tracking was −19.1 ± 2.1 % and median plasma NTproBNP was 107 pg/ml (IQR: 46–147).

Table 1 Characteristics of patients

Of the 780 consecutive patients assessed in clinic, 313 (40 %) had symptoms or signs suspicious of heart failure and an LVEF ≥ 50 %, thus fulfilling the study criteria (Table 1), the rest (467) had LVEF < 50 %. Of the 313 patients with LVEF ≥ 50 %, 76 were considered not to have HeFNEF, 99 to have possible HeFNEF and 138 to have definite HeFNEF. Compared to the control group, patients with definite HeFNEF were older, were more likely to have AF, had more symptoms and more signs of fluid retention, and were more likely to be treated with diuretics. The proportion of patients with a history of IHD, diabetes, hypertension and COPD were similar in all three groups of patients, but fewer patients in Group A (no substantial cardiac dysfunction) had AF. When the analysis was restricted to patients in sinus rhythm (Table 2), patients with definite HeFNEF still were older, had more symptoms, worse renal function and higher systolic blood pressure and NTproBNP levels than the other two subgroups.

Table 2 Characteristics of patients in sinus rhythm

Echocardiographic findings

LVEF was similar in all three sub-groups of patients and in control subjects. GLS was impaired in each group of patients compared to control subjects but most severely impaired in patients who fulfilled our criteria for definite HeFNEF. Patients with definite HeFNEF had greater LV volumes, more RV dysfunction (as estimated by TAPSE) and more substantial mitral and tricuspid regurgitation than the other two groups. They also had higher pulmonary pressure and IVC diameter. Similar results were seen when the analysis was restricted to patients in sinus rhythm (Table 2).

For 14 patients (3 with no heart disease, 4 with possible HeFNEF and 7 with definite HeFNEF) analysis of GLS was not possible due to poor quality images. For other patients, 85 % of LV segments could be analyzed. Internal consistency and reproducibility of measurements of GLS (mean difference = 0.12 %; 95 % Limits of agreement: −0.75, 0.99) and individual segments (mean difference = 0.12 %; 95 % Limits of agreement: −4.16, 4.40) were good.

Amongst the patient group as a whole, GLS was more negative (better function) in women v men [−15.8 (3.0) % vs. −13.8 (2.8) %, p < 0.001] and in those with no history of IHD [−15.2 (3.0) % vs. −14.0 (2.9)  %, p = 0.001]. GLS was higher (more impaired) in patients who were taking loop diuretics (−14.4 (3.0) % vs. −15.4 (3.0) %, p = 0.007). There was no relation between GLS and either NYHA class or congestion score. There was no relation between GLS and age (r: −0.06, p = 0.31) but there was a relation between GLS and log (NT-proBNP) (worsening long axis function associated with increasing NT-proBNP, r: 0.32, p < 0.001: see Fig. 1) and renal function (creatinine: r: 0.20, p = 0.001, urea: r: 0.16, p = 0.009).

Fig. 1
figure 1

Relation between GLS and log (NT-proBNP): LV long axis function worsens with higher plasma concentrations of NT-proBNP

Outcome

All patients were followed for at least one year. There were 62 primary outcome events during the median follow up of 572 (IQ range: 440–736) days. The first qualifying event was hospitalisation due to worsening HF in 30 patients and CV death in 32 patients. In univariable Cox regression analysis (Table 3), GLS, but not LVEF, predicted events. Left atrial diameter, TAPSE, TR systolic gradient as well as moderate mitral regurgitation or tricuspid regurgitation were also associated with a worse outcome.

Table 3 Univariable Cox regression models for the composite endpoint CV death or HF hospitalization (313 patients, 62 events)

In multivariable analysis, increasing urea and log [NTproBNP] were the only variables independently related to an adverse outcome in Model A (Table 4). In Model B (Table 5), increasing urea and logNTproBNP, as well as increasing LAVI, IVC diameter and the presence of AF were independently related to outcome. Patients with a definite diagnosis of HeFNEF had the highest rate of adverse events; 25 % at 1 year (Fig. 2).

Table 4 A multivariable cox regression for the primary endpoint of CV death and admission with heart failure
Table 5 A multivariable cox regression for the primary endpoint of CV death and admission with heart failure (left) and for the secondary endpoint of death for all causes (right)
Fig. 2
figure 2

Kaplan-Meier (KM) curves for the composite endpoint (HF hospitalization or CV death) in patients considered to have no evidence of significant cardiac disease (solid blue line, Group A), in patients with “possible HeFNEF” (red line, Group B) and patients with definite HeFNEF (green line, Group C). For patients with definite HeFNEF, the hazard ratio for this outcome was 7.86 (95 % CI: 2.82–21.87; p < 0.001) compared to those considered not to have evidence of significant cardiac disease

There were 48 deaths during the median follow of 598 (IQ range 471–756) days for all patients: AF, a greater LAVI and higher plasma NTproBNP were independent predictors of all-cause mortality (Table 5). Thus, a definite diagnosis of HeFNEF was associated with increased all-cause mortality.

Discussion

We have found that in patients referred with a possible diagnosis of heart failure but with a normal LVEF, a subtle impairment of LV longitudinal systolic function can be detected with advanced echocardiographic techniques, even in the absence of structural (dilated left atrium) or biochemical (increased NTproBNP) abnormalities. Worsening LV longitudinal systolic dysfunction was related to other indices of abnormal function. However, GLS provided no additional prognostic information to biochemical measures of cardiac dysfunction.

HeFNEF remains a controversial entity. The recently updated ESC guidelines on heart failure [13] have lowered the threshold levels of natriuretic peptides for the exclusion of chronic heart failure to 125 ng/L, recognising that previous guidance may have been too stringent. However, this increases the likelihood of including patients with no substantial cardiac pathology under the umbrella of “HeFNEF”, particularly older people in whom HeFNEF is especially common [14]. Indeed, 5 control subjects and 263 (84 %) of patients with suspected HeFNEF in the present study had values ≥125 ng/L. Although values <125 ng/L may exclude serious ventricular dysfunction or indicate that it is very well managed, higher thresholds are required when NT-proBNP is used to identify patients likely to have disease. It is unlikely that the single threshold of 400 ng/L used in our analysis will be optimal for clinical practice although it conformed to the guidelines contemporary to this study [5, 15].

We have previously shown that the velocity of long-axis systolic shortening may be reduced in patients with HeFNEF, indicating impairment of the longitudinal component of systolic contraction [16, 17]. Mild impairment of LV longitudinal function is present even in young patients with hypertension (GLS: −17.5 ± 2.8 %) compared with healthy subjects (−21.1 ± 2.0 %, p < 0.001) or athletes (−22.2 ± 2.7 %) but global circumferential strain, global radial strain and torsion are similar in these three groups [10]. Recently, Morris and colleagues demonstrated that GLS was abnormal in patients with HeFNEF [18]. Worse overall myocardial performance was associated with worse symptoms. Furthermore, in animal models, it has been shown that impairment of LV longitudinal function occurs in the early phase of LV hypertrophy, and continues to worsen until development of congestive heart failure [19].

A common underlying factor may be myocardial fibrosis [6, 20] leading to reduced LV longitudinal function, altered torsion and delayed untwisting, loss of LV suction and thus impairment of early diastolic LV filling. Ventricular hypertrophy, delayed myocardial relaxation, atrial dysfunction and abnormal electrical conduction could also contribute to disturbed systolic and diastolic function.

Our results are also supported by a recent analysis of the I-PRESERVE trial [21]: up to 40 % of patients enrolled in that trial did not have structural or functional alterations of the cardiac chambers on echocardiography, such as LV hypertrophy, LA dilatation or evidence of diastolic dysfunction. However, in common with analyses of both the CHARM-Preserved and PEP-CHF trials, NT-proBNP was found to be a stronger predictor of outcome than any echocardiographic measure [22, 23]. We found that a combination of both a dilated LA and raised NTproBNP dramatically increased the risk of an adverse outcome.

Although deterioration in LV longitudinal function might be an early manifestation in the developing pathophysiology of HeFNEF, our finding also supports the notion that raised NTproBNP plasma levels and other echocardiographic measurements of pressure or fluid overload, such as increased LA size or IVC diameter, might be more useful in characterising and identifying those patients at higher risk of adverse outcome [24].

Limitations

We chose to define cardiac dysfunction by LAD rather than LAVI but LAVI was well above the cut-off defined in guidelines as an indicator of HeFNEF [5]. Both LAD and LAVI appear to identify patients with HeFNEF [25] and both measurements are strongly related to prognosis.

Standard echocardiographic measurements of diastolic funtion (E/E’ and E/A ratio) were not routinely performed and were not available. Although E/E’ ratio predicts cardiac events in patients with systolic HF [26], it may add little prognostic information to LAD. Large studies show that E/E’ is not an independent predictor of adverse outcome in patients with HeFNEF [21, 27]. Moreover, we did not evaluate LV mass, which is also known to be associated with increasing cardiovascular risk in the general population [28].

Patients with symptoms but no conventional evidence of cardiac dysfunction may have had “latent” HeFNEF that might become overt on exercise, provoking symptoms [29, 30]. These patients had a similar prevalence of risk factors for cardiac dysfunction and heart failure, such as ischaemic heart disease, as patients with definite structural heart disease. It is also possible that some had symptoms reflecting reversible myocardial ischaemia. However, these patients had an excellent prognosis, in stark contrast to the patients we defined as definite HeFNEF.

Many patients had AF, which could cause problems in interpretation of echocardiographic data and might be a direct cause of atrial enlargement. AF could also cause a rise in natriuretic peptides [31]. Diagnosis of HeFNEF in patients with AF remains unsatisfactory.

Conclusions

GLS is impaired in patients who present with symptoms or signs of heart failure, even when there is no other conventional evidence of this diagnosis. Impaired GLS is associated with a worse prognosis but does not provide additional prognostic information to biochemical measures of cardiac dysfunction or LAVI.