Introduction

Both in patients with pulmonary hypertension (PH) and in patients with different types of congenital heart disease (CHD), the right ventricle (RV) performs under increased pressure loading. The RV adapts by hypertrophying, however at a certain point the RV is unable to cope with the increased pressures and RV failure will ensue. Consequently, RV function is an important determinant of prognosis and of therapeutic strategy in these patients. For instance, in patients with pulmonary valvular stenosis, timing of intervention is partly dependent on RV function [1]. In patients with PH, deterioration of right ventricular volumes and ejection fraction (RVEF), increased RV end-diastolic volume (RVEDV) and stroke index are associated with poor outcome [2, 3]. Furthermore, for patients with transposition of the great arteries (TGA) after an atrial switch operation, in which the RV supplies the systemic circulation (i.e. systemic RV), decline in RV function is one of the most important clinical problems. Therefore RV volumes and function are frequently used in follow-up of these patients, making accurate and reproducible measurements highly important.

As both 2D and also 3D echocardiography of the RV remain less reproducible than cardiac magnetic resonance imaging (CMR), the latter is still considered to be the reference standard for the quantification of RV volumes and EF [47]. Whether trabeculae and papillary muscles should be included or excluded from the blood volume is subject of debate. Throughout literature both methods are used [2, 810]. However, many studies have not clearly described whether trabeculae and papillary muscles were included or excluded from the RV blood volume [1114]. The impact of trabeculae is assumed to be small in healthy individuals, but Winter et al. showed that exclusion of trabeculae from the RV blood volume resulted in a substantial difference of RVEDV, RVESV and RVEF in patients with a systemic right ventricle [15]. Although theoretically more accurate, Winter et al. also showed that manual tracing of trabeculae has low reproducibility and therefore can be considered less favorable for longitudinal follow-up [5, 15].

Freling et al. [16] recently reported that semi-automatic pixel-intensity based segmentation software is able to exclude trabeculae and papillary muscles from the RV blood volume with high reproducibility in Tetralogy of Fallot (TOF) patients with predominantly volume overloaded RVs. Moreover, this resulted in a substantial difference in RV volumes and RVEF compared to the method which includes these structures in the RV blood volume. In patients with increased RV pressure the trabeculae are likely to be coarser. The impact and reproducibility of excluding trabeculae and papillary muscles with this semi-automatic software in patient groups with RV pressure overload has not been investigated up to now.

The purpose of this multicenter study was to determine the impact of excluding trabeculae and papillary muscles, on RV volumes and function as assessed by CMR in patients with pressure or combined pressure and volume overload of the RV and healthy controls. Secondly, we determine the reproducibility of this methodology when semi-automatic pixel-intensity based software is used.

Methods

Study design and population

One hundred CMR studies were included in the analysis (median age 36.2 years, 51 % male). Four groups of 20 adult patients with pressure overloaded RV’s were analyzed: patients with pre-capillary PH, patients with right ventricular outflow tract obstruction (RVOTO) after arterial switch operation (ASO) for TGA, patients with repaired TOF and patients with TGA and atrial switch procedure (Mustard or Senning operation). A reference group of 20 healthy controls was also included.

PH was defined in accordance with the ESC/ESR guidelines as a mean pulmonary artery pressure of ≥25 mmHg and a pulmonary capillary wedge pressure of ≤15 mmHg [17]. Only patients with pre-capillary (i.e. with arterial vascular changes) PH were included, all were diagnosed with either chronic trombo-embolic or idiopathic PH. In all patients RV systolic pressure (RVSP) was measured using Doppler echocardiography on the day of CMR investigation. RVSP was measured using the peak velocity of tricuspid insufficiency plus estimated right atrial pressure. Patients with repaired TOF were included if a RVSP of ≥36 mmHg was measured by Doppler echocardiography [18]. Patients after ASO were included if RVSP measured by Doppler echocardiography was ≥36 mmHg or if, using Doppler echocardiography, a mild or moderate RVOTO was measured, defined as a maximum gradient of ≥25 mmHg. For patients with TGA after atrial switch procedure systolic blood pressure was used to determine RVSP. Basic patient characteristics for each patient group are illustrated in Table 1. Degree of pulmonary (PR) and tricuspid valve regurgitation (TR) were assessed semi-quantitatively with echocardiography, based on color-Doppler and continuous wave Doppler pattern and graded as: none or trace, mild, moderate or severe.

Table 1 Basic patient characteristics of each patient group and of healthy controls

In this retrospective study, MR images from two tertiary referral hospitals were analyzed. One centre contributed 59 patient CMR datasets and 20 control subjects. The second centre provided the remaining 21 patient CMR datasets. The datasets in this study were obtained between May 2008 and July 2012. Prior to analysis, all patient and control data were encoded to preserve anonymity. All CMR datasets were acquired in a routine clinical setting and anonymized for analysis. The medical ethics committees waived the need for informed consent.

CMR imaging protocol

Datasets were obtained using commercially available 1.5 T MR scanners [Ingenia R4.1.2; Philips Healthcare, Best, The Netherlands (n = 79); Magnetom Sonata, Siemens Healthcare; (n = 7) and Magnetom Avanto; Siemens Healthcare, Erlangen, Germany (n = 14)]. For all studies dedicated chest or torso phased array parallel-imaging capable surface coils were used with 12–28 elements. CMR images were acquired during repeated end-expiratory breath holds. Cine images were acquired using a retrospectively gated balanced steady state free precession sequence with 25–30 cardiac phases per cardiac cycle. Slice thickness used were 6 with 4 mm gap (n = 21) or 8 with 0 mm gap (n = 79). Sequences included multi-slice, multi-phase cine short-axis, longitudinal four-chamber, vertical two-chamber and RV outflow views. The multi-slice cine short-axis acquisitions were planned from above the mitral valve up to and including the cardiac apex. The following ranges of other scan parameters was used: TR 2.7–3.4 ms; TE 1.1–1.7 ms; flip angle 80º–90º; matrix 171–192; voxel size: 1.25 × 1.25 × 8.0 mm and 1.7 × 1.7 × 6.0 mm. Parallel imaging factors varied between 0–3.

CMR image analysis

Image analysis was performed using Qmass MR Research edition version 7.4.14.0 (Medis, Leiden, the Netherlands) [16]. Segmentation was performed on end-diastolic and end-systolic phases only.

The end-diastolic and end-systolic phase was selected by visual assessment as the phase with the largest and smallest RV cavity sizes respectively, taking into account the longitudinal 4-chamber, vertical 2-chamber and RV outflow tract as reference views. If visual assessment was difficult, multiple frames were contoured to determine the correct end-diastolic or end-systolic phase. Using a previously described RV analysis protocol the RV epicardial and endocardial contours were manually traced from the most apical to the most basal short-axis slice [19]. Only the portion of the outflow tract below the pulmonary valve was included in the blood volume in the basal slice in which the pulmonary valve was visible. If more than 50 % of the tricuspid annulus or atrium was visible in a basal slice the valve area was excluded from the blood volume. Epicardial and endocardial contours overlapped at valve borders and septum, as the septum was considered part of the left ventricle. For patients with systemic RV, the septum was considered part of the RV and included in the RV myocardial volume.

Based on the methodology described above, two methods were used for determining RV volumes, function and mass. With method 1 trabeculae and papillary muscles were included in the blood volume. With method 2, trabeculae and papillary muscles were excluded from the blood volume (Fig. 1) and added to the myocardial volume. For both methods the volume between the endo- and epi-cardial contour was considered myocardial volume. Selection of trabeculae and papillary muscles was done using semi-automatic pixel-intensity based segmentation software. The segmentation software is based on the signal intensity distribution of MR images and has been described in detail by Freling et al. [16]. In brief, voxels within the epicardial contour are classified as either blood volume or myocardial volume according to their signal intensity, taking into account spatial variations in signal intensity. Based on this algorithm, trabeculae and papilary muscles were excluded from the blood volume and included in the myocardial volume. The algorithm works similar for images generated by the different scanners used in this study. It was a possibility to manually change the threshold for every slice, in order to select the same trabeculae in end-diastole and end-systole. Observers selected only trabeculae with signal intensity similar to the intensity of RV myocardium. Individual voxels could also be selected or deselected in case of artifacts due to nonlaminar flow.

Fig. 1
figure 1

RV contour tracing only (A-1 and B-1) and with semi-automatic selection of trabeculae (A-2 and B-2). Two methods of measuring RV volumes in a healthy control (a) and patient after atrial switch procedure (b). Method 1: inclusion trabeculae in the blood volume (a-1 and b-1); Method 2: exclusion of trabeculae from the blood volume, using identical endocardial contours (a-2 and b-2)

For both methods, RV volumetric parameters were calculated by the sums of the traced contours multiplied by slice thickness in all short-axis slices. For method 1 the volume of trabeculae and papillary muscles was included in the RV blood volume and for method 2 this was excluded from the blood volume. Stroke volume (SV) was defined as the difference between end diastolic volume and end systolic volume. All volumetric data were indexed for body surface area (BSA), which was calculated using the Dubois–Dubois formula (0.20247 × height (m)0.725 × weight (kg)0.425). EF was calculated by SV/EDV × 100 %. For method 1 myocardial volume was defined as epicardial minus the endocardial contour, for method 2 end-diastolic trabecular volume was added to the myocardial volume. RV mass was quantified by multiplying the specific density of myocardium (1.05 g/ml) with the end-diastolic myocardial volume.

Reproducibility

Intraobserver reproducibility of both methods was assessed by re-analyzing five randomly selected CMR datasets from every patient group, as well as the healthy control subjects by the primary observer. In total 25 datasets were reanalyzed. To determine interobserver variability a second observer re-analyzed the same 25 datasets. Observers were unaware of the results of the first analysis and there was an interval of at least two weeks between the first and second analysis. The observers had equal experience in RV volumetric analysis and received the same training for Qmass MR research edition.

Statistical analysis

Continuous data were expressed as median and interquartile range (IQR) or mean value ± standard deviation (SD) as appropriate. Mean differences ± SD between method 1 and 2 was calculated for RVEDV/m2, RVESV/m2, RVSV/m2, RVEF and RV mass/m2, using paired Student’s T test. Differences in RVEDV/m2, RVESV/m2 and RVEF found in the patient groups were compared to the healthy control group using a one-way ANOVA with posthoc Dunnett’s test. For the one-way ANOVA data underwent logarithmic transformation if necessary (i.e. if homogeneity of variances was inequal). Intra- and inter-observer agreement were assessed using Bland–Altman plots and intraclass correlation coefficients (ICC). Paired Student’s T test was used to test for significant differences between observer 1 and 2 and between the first and second measurements of observer 1. Mean differences ± SD were calculated for all measurements. Lastly to compare reproducibility of both methods the inter- and intra-observer agreement coefficient of method 1 and 2 were calculated for each measurement. The ACintra/inter was calculated using the following formula: ACintra/inter = 100 × (1 – 2 × |Obs1 − Obs2|/Obs1 + Obs2); in which Obs1 and 2 are the first and the second observation (or observer). The ACintra/inter calculated for method 1 and 2 were compared using a paired Wilcoxon signed rank test. Using this formula the relative observer differences of both methods are compared. Using a Bonferonni correction for multiple measurements p values of <0.01 were considered statistically significant. All data analysis was performed in IBM SPSS statistics version 20.0 (IBM SPSS, Chicago, IL, USA).

Results

Exclusion of trabecular volume

RVEDV/m2, RVESV/m2, RV EF and RV mass/m2 measured including (method 1) and excluding (method 2) RV trabeculae from the RV blood volume are listed in Table 2. For all patient groups and for healthy controls, exclusion of trabeculae and papillary muscles from the blood volume resulted in significantly decreased RVEDV/m2 and RVESV/m2 and a significantly increased RVEF and RV mass/m2 (Table 2). Of note, the differences in EDV/m2, ESV/m2, RVEF, and RV mass between both methods were most pronounced in the patients after atrial switch procedure and least pronounced in the PH patients, with mean absolute differences in EF of 10.1 ± 2.3 and 4.7 ± 1.6 %, respectively. In healthy controls an absolute increase in RVEF of 3.4 ± 0.8 % was measured. Of note, the differences in EDV/m2, EDV/m2, RVEF and RV mass were significantly larger in all patients groups compared to the healthy controls (p < 0.01).

Table 2 RV volumes and function

Reproducibility

For both methods inter- and intra-observer agreement was high in all measurements, as illustrated by high ICCs with small limits of agreement (Table 3; Fig. 2). For both methods, RVEDV, RVESV and RV mass showed significant differences between repeated measurements. However, mean differences were small and considered not clinically relevant. In Fig. 2, Bland–Altman plots show interobserver variability for RVESV, RVEDV and RVEF for both methods. For RVEDV, RVESV and RV mass the limits of agreement were narrower when trabeculae and papillary muscles were excluded from the RV blood volume (method 2). The ACintra/inter of all measurements was not statistically significantly different for RVEDV, RVESV and RVEF (p > 0.1). Method 2 had a significantly better ACinter than method 1 for RV mass measurement, with a median (IQR) ACinter of respectively 94.1 (92.1–97.1) % and 77.2 (72.1–82.6) %.

Table 3 Inter- and intra-observer agreement
Fig. 2
figure 2

Bland–Altman plots for method 1 and method 2. Bland–Altman plots showing the mean value of both observers on the x-axis and absolute differences between the observers on the y-axis for each paired observation. Limits of agreement are defined as ±2 SD

Discussion

Exclusion of trabeculae and papillary muscles resulted in substantial alterations of RV volumes, RVEF and RV mass in a wide range of patient populations with pressure and volume overloaded RVs. Furthermore, we found that these differences in RV parameters vary widely depending on the exact condition underlying RV overload. Although prior studies already established this fact in general terms, the major impediment to widespread adoption of this method in clinical practice was the lack of a fast and reproducible way to measure the exact amount of RV trabeculae and papillary muscles. We found that exclusion of RV trabeculae using semi-automatic pixel-intensity based software resulted in fast and highly reproducible RV measurements. This is opposed to manual tracing of trabeculae which has previously been shown to be unreliable [15, 20].

Accurate and reproducible measurement of RV volume and function is mandatory because of the prognostic and therapeutic implications in patients with PAH and CHD [1, 3, 21, 22]. The current study underscores that exclusion of trabeculae has a significant impact on RV volumes, RVEF and RV mass in both CHD and PAH patients with overloaded RVs. Moreover, the impact of excluding trabeculae varied widely between patient groups, from a change in RVEDV of −12.3 ± 4.6 ml/m2 in PH patients to −29.2 ± 6.6 ml/m2 in patients with a systemic RV. Healthy controls also exhibited significant differences in all RV measurements, but these were significantly smaller (p < 0.01) compared to the differences observed in patient groups. Consequently, RV volume and function in most patients will be closer to or in the normal range after exclusion of trabeculae from the RV blood volume.

Currently, there is no clear standard for RV volumetric analysis or consensus on how trabecular structures should be handled. Major obstacles to exclude trabeculae and papillary muscles from the RV blood volume have been the time investment of performing manual tracing of these structures and the low reproducibility [15, 20]. Several studies in CHD patients differ on the point of including or excluding RV trabeculae and papillary muscles from the RV blood volume [3, 4, 11, 12, 20, 22, 23] or are not clear about the methodology used [13, 14]. In the current study we found that using semi-automatic pixel-intensity based segmentation software results in highly reproducible RV volumetric measurements. Because inclusion or exclusion of trabeculae has a major impact on RV parameters as measured with CMR, studies using different methodologies are incomparable. Application of the method described in this study may be a step forward to achieve uniformity of RV volumetric measurements, which is important to compare the effect of interventions aimed at preserving or improving RV function. However, there are only few reports using this new methodology and it is of great importance that new studies are undertaken to determine clinically relevant cut-off values using this semi-automated method.

When comparing the current study to prior studies investigating the impact of trabeculae and papillary muscles on RV volume and function, some important differences can be observed. Winter et al. studied 29 patients with systemic right ventricles and found an increase in RVEF of 7.4 ± 3.9 % compared to 10.1 ± 2.3 % in our report. In contrast to our results, which are based on semi-automated pixel-intensity based segmentation, manual exclusion of trabeculae was substantially less reproducible in the study of Winter et al. [15]. Moreover, both our study and the study by Freling et al. [16] even demonstrated a higher reproducibility for respectively RV mass and both RVEDV and RV mass using this semi-automatic method to exclude trabeculae compared to only endocardial contour tracing. We attribute this finding to observer variation in handling of trabeculae adjacent to the endocardial border. This can result in small differences for endocardial contour tracing, which will be rectified if all trabeculae are excluded. Sievers et al. [24] studied the effect of trabeculae on RV volumes in healthy controls and reported a difference in RVEF of only 1.72 % compared to 3.4 ± 0.8 % in our study, however baseline RVEDV values also differed considerably with ours, indicating that these study populations are not comparable. Freling et al. [16] investigated a different group of TOF patients, with volume overloaded RV’s using the same software package as described in the current study and found a similar increase in RVEF of 7 ± 4 versus 7 ± 2 % in our study.

The current study only focused on one of the possible sources of error in RV volumetric assessment with CMR. An important source of error remains basal slice selection and delineation of the tricuspid valve. In this study a short-axis orientation for RV volumetric measurement was used as this is standard practice in our hospitals. Axial orientation, however, might result in higher reproducibility than short-axis orientation in CHD patients with severely dilated RVs, decreasing the difficulty of valve delineation in the basal slices [25, 26]. To minimize errors at the tricuspid and pulmonic valve, images were cross linked to RV 2-chamber, 4-chamber and RV outflow tract views. Furthermore only a small portion of the patients had severely dilated RVs, therefore it is unlikely that the slice orientation would have resulted in important differences for the current study. The impact and reproducibility of the semi-automatic software used in the current study will likely be similar in axial slice orientation, as the software is not restricted by geometric assumptions and uses signal intensity to select trabeculae. Finally, another source of error might be inadequate selection of the RV end-systolic frame. In daily practice both RV ESV and LV ESV are often assessed in the LV end-systolic frame. However in patients with CHD, who often have right bundle branch block, timing of the RV end-systolic frame can be delayed compared to the LV end-systolic frame [27]. Therefore RV end-diastolic and end-systolic phase was based solely on RV cavity size.

Study limitations

This study is unable to determine whether including or excluding trabeculae best represents true RV volumes, as a gold standard in vivo is lacking. Because the SV remains equal with both methodologies other CMR measurements are unable to serve as a reference standard. However, theoretically exclusion of trabeculae is more accurate as these do not contribute to RV blood volumes in end-diastole or end-systole.

Furthermore no invasive measurements were available to determine the true RV pressure in these patients. Therefore estimations of RVESP and RVOT gradient based on Doppler-derived flow velocities were used, which might not always be accurate and have limitations. Nonetheless these are the best available non-invasive alternatives to assess degree of RV pressure overload or RVOT stenosis.

Conclusion

Exclusion of trabeculae and papillary muscles has a significant impact on measured RV volumes, mass and EF. The magnitude of the differences varies between patient groups and is significantly larger in all investigated patient groups with overloaded RVs than in healthy controls. Importantly, exclusion of trabeculae with semi-automatic pixel-intensity based software is highly reproducible and superior compared to manual contour tracing for RV mass.