Abstract
To compare the performance of semi-automatic versus manual segmentation for ECG-triggered cardiovascular computed tomography (CT) examinations prior to transcatheter aortic valve replacement (TAVR), with focus on the speed and precision of experienced versus inexperienced observers. The preoperative ECG-triggered CT data of 30 consecutive patients who were scheduled for TAVR were included. All datasets were separately evaluated by two radiologists with 1 and 5 years of experience (novice and expert, respectively) in cardiovascular CT using an evaluation software program with or without a semi-automatic TAVR workflow. The time expended for data loading and all segmentation steps required for the implantation planning were assessed. Inter-software as well as inter-observer reliability analysis was performed. The CT datasets were successfully evaluated, with mean duration between 520.4 ± 117.6 s and 693.2 ± 159.5 s. The three most time-consuming steps were the 3D volume rendering, the measurement of aorta diameter and the sizing of the aortic annulus. Using semi-automatic segmentation, a novice could evaluate CT data approximately 12.3 % faster than with manual segmentation, and an expert could evaluate CT data approximately 10.3 % faster [mean differences of 85.4 ± 83.8 s (p < 0.001) and 59.8 ± 101 s (p < 0.001), respectively]. The inter-software reliability for a novice was slightly lower than for an expert; however, the reliability for a novice and expert was excellent (ICC 0.92, 95 % CI 0.75–0.97/ICC 0.96, 95 % CI 0.91–0.98). Automatic aortic annulus detection failed in two patients (6.7 %). The study revealed excellent inter-software and inter-observer reliability, with a mean ICC of 0.95. TAVR evaluation can be accomplished significantly faster with semi-automatic rather than with manual segmentation, with comparable exactness, showing a benefit for experienced and inexperienced observers.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
In the last decade, transcatheter aortic valve replacement (TAVR) has become a routine alternative procedure for open surgical valve replacement in high-risk patients that shows comparable short- and long-term results [1, 2]. ECG-triggered computed tomography (CT) has become one of the most important imaging modalities for use prior to implantation and provides exact measurements of the aortic root, access paths and fluoroscopic projections in patients with severe aortic stenosis [3, 4]. CT provides more accurate sizing of the aortic annulus than 2D echocardiography and can reduce the rate of paravalvular leakage [5–9].
The number of TAVR procedures has been rising rapidly in recent years. In addition, improvement in the implantation method and in preoperative imaging has resulted in a variety of new parameters, which are measured prior to the procedure, and has resulted in increased effort and prolonged evaluation time. Particularly, accurate measurement of the aortic root, including the sizing of the aortic annulus and of the distance to the ostia, are crucial for the success of the procedure and avoidance of perioperative complications [9–11]. Several studies have described slightly different approaches for evaluating the aortic root [4, 7, 12–14], and recent studies have shown a wide range of inter-observer reliability for these measurements [11, 15, 16].
It was hypothesized that semi-automatic workflows that guide physicians through the evaluation process as well as automatic aortic annulus detection of the aortic annulus plane could reduce the duration of the preoperative analysis and could improve measurement accuracy, particularly for inexperienced observers.
The aim of this study was to evaluate the potential advantage of semi-automatic image data segmentation versus a manual system for experienced and inexperienced observers.
Materials and methods
Patient population
Thirty consecutive, high-risk patients who had been scheduled for TAVR due to severe symptomatic aortic stenosis and received a successful ECG-triggered cardiac CT as a part of standard preoperative planning between the 4th and 22nd of October 2013 were included in our study. The baseline characteristics of the patients are shown in Table 1. Prior to the CT scan, the patients provided written informed consent. The scan protocol was performed according to the statutes of our local ethics committee.
CT protocol
All CT scans were performed using a 128-row-detector, dual-source CT (SOMATOM Definition Flash® Siemens Healthcare, Erlangen, Germany) with a prospectively ECG-triggered, high-pitch technique according to the standard protocol of our facility. The prospective ECG-triggering was set to capture the aortic root in late diastole to avoid movement artifacts. In total, 70 ml (Iomeprol, 400 mg iodine/ml, Bracco Imaging, Konstanz, Germany) of non-ionic, iodinated contrast medium was applied at a flow rate of 3.5 ml/s prior to the scan. After a delay of 6 s, a bolus tracker in the ascending aorta using a threshold of 100 HU was performed and by reaching the threshold the scan in cranio-caudal direction. Scanner parameters are as previously published [17].
Evaluated software
The entire CT image analysis was completed using commercially available, client–server-based software for 3D reading and advanced visualization (syngo.via VA20, Siemens AG, Healthcare, Erlangen, Germany). For the analysis, two separate workflows (“semi-automatic segmentation” and “manual segmentation”) were used. First, a TAVR workflow (“semi-automatic segmentation”; syngo.CT Cardiac Function—Valve Pilot) that included an automatic annulus plane and coronary ostial detection plus an automatic tool for fluoroscopy angulation estimations was used (“semi-automatic segmentation”; Valve Pilot; cf. Fig. 1). Second, a version of the TAVR workflow (“manual segmentation”) that required manual segmentations using MPR reconstructions to measure the aortic root (cf. Fig. 2). In both workflows, automatic detection of the vessel centerlines of the aorta and iliac runoffs was available, as was an automatic 3D volume rendering technique (VRT) tool.
Image analysis
All measurements were performed separately by two radiologists (B.F., C.J.) with 5 years and 1 year of experience in analyzing cardiac CT images. First, all datasets were analyzed using the manual segmentation followed by the semi-automatic software. All cases were evaluated in multiple sessions without a fix order selecting the cases randomly. Both of the examiners were blinded to the results of the other examiner as well as to the results of the manual analysis.
Based on our institutional standards, the duration of the following evaluation steps was estimated: pre-processing, data loading, 3D-VRT and measurements of aorta diameter; the curved multiplanar reconstructions (cMPR) of the aorta and iliac runoffs with diameter measurement and planimetry of the aortic annulus; estimation of the surface- and perimeter-derived effective diameter (ED); measurement of coronary ostial heights; and determination of the aortic annulus angulations for fluoroscopy. The list of these variables with corresponding description is shown in Table 2. Additionally, inter-software and inter-observer reliability for the most critical parameters, such as the surface- and perimeter-derived ED as well as for the ostial heights, were assessed.
Statistical analysis
Statistical analyses were performed using commercially available statistical software (SPSS, v.17, for Windows, IBM Corp., Chicago, IL, USA). All variables were tested with Shapiro–Wilk test to determine the distribution. The continuous quantitative parametric variables were expressed as the means ± standard deviations, and the level of significance was defined as p < 0.05 and tested using Student’s paired t test The null hypothesis was tested using t-distribution. Non-parametric variables were expressed as the median ± quartiles. The null hypothesis was tested using Mann–Whitney U Test or Wilcoxon signed ranks test.
The intra-class correlation coefficient (ICC), which was defined as the ratio of the between-subject variance to the total variance, and Cronbach’s Alpha were determined to estimate the inter-software and inter-observer reliability. The 95 % confidence intervals (CIs) were calculated, and the ICC values and were interpreted as follows: absence of agreement, 0 or less; poor agreement, less than 0.20; fair agreement, 0.21–0.40; moderate agreement, 0.41–0.60; good agreement, 0.61–0.80; and excellent agreement, greater than 0.80 [18]. Bland–Altman plots with 95 % limits of agreement were used to visualize the differences between the results from the analysis software and from the observers [19].
Results
Patient characteristics
All of the patients fulfilled the inclusion criteria (N = 30). The study population consisted of 17 (56.7 %) female and 13 (43.3 %) male patients with an average age of 78.4 ± 6.7 years. All of the patients had a tricuspid native aortic valve, and the mean heart rate during the image acquisition was 79.7 ± 16.8 bpm (ranging from 58 to 104 bpm). No beta-blockers were administered because of clinical contraindications prior to the examination.
Analysis duration
The mean total evaluation times when using semi-automatic segmentation were 520.4 ± 117.6 (median 485.5) s, for the expert and 580.2 ± 162.8 (median 517.0) s for the novice. When using manual segmentation, the expert required 607.8 ± 104.6 (median 592.0) s for the mean total evaluation, and the novice required 693.2 ± 159.5 (median 670.0) s (Fig. 3). The expert was, on average, 59.8 ± 101.4 s (approx. 10.3 %, p ≤ 0.02) faster than the novice when using semi-automatic segmentation and was 85.4 ± 83.8 s (approx. 12.3 %, p ≤ 0.001) faster when using the manual software.
The mean duration of each step, which depended on the software type as well as on the observer, is shown in Fig. 4 and Table 3. Regardless of the level of experience, the three most time-consuming steps were the measurements of aortic diameters, the 3D-VRT and the sizing of the aortic annulus.
Semi-automatic segmentation (Valve Pilot) successfully recognized the aortic annulus plane in 28 (93.3 %) patients. In 4 (13.3 %) of the 28 patients, additional corrections of the aortic annulus plane were necessary, and minor manual corrections of the automatically recognized annulus contour were performed in all of the patients. The automatic recognition of the coronary ostia worked correctly in 30 (100 %) patients. No patients had an abnormal coronary origin, bicuspid aortic valve. All patients presented at least mild valvular calcifications.
Inter-software reliability
The overall inter-software reliability was excellent, and the details are listed in Table 4. The Bland–Altman analysis of the mean inter-software measurements is shown in Fig. 5A (the mean difference: 0.42 mm, upper/lower limit: 2.61–1.76). Two measurements exceeded the levels of agreement.
Expert
The mean differences between the semi-automatic and manual segmentations performed by the expert were 0.5 ± 1.1 mm for the perimeter-derived ED and 0.5 ± 0.5 mm for the area-derived ED. The inter-software reliability was excellent for the perimeter- and surface-area-derived EDs (ICC 0.96, 95 % CI 0.92–0.98/ICC 0.98, 95 % CI 0.87–0.99).
For the expert, the mean LCA and RCA heights did not significantly differ between the semi-automatic and manual measurements (0.5 ± 1.9 mm, p = 0.10/0.7 ± 3.9 mm, p = 0.36). Furthermore, the inter-software reliability was high for the LCA and RCA height measurements (ICC 0.88, 95 % CI 0.74–0.94/ICC 0.91, 95 % CI 0.81–0.96).
Novice
The measurements performed by the novice resulted in mean differences between the semi-automatic and manual segmentations of 0.9 ± 1.4 mm for the perimeter-derived ED and 0.3 ± 1.1 mm for the area-derived ED. The inter-software reliability for the inexperienced observer was slightly lower than that for the experienced observer; however, both were excellent (ICC 0.92, 95 % CI 0.75–0.97/ICC 0.96, 95 % CI 0.91–0.98).
For the novice, the mean LCA and RCA heights for the semi-automatic and manual measurements were not significantly different (0.1 ± 2.4 mm, p = 0.92/0.24 ± 2.3 mm, p = 0.58). The inter-software reliability for the novice was lower than that for the expert; however, both were high (ICC 0.87, 95 % CI 0.73–0.94/ICC 0.88, 95 % CI 0.76–0.95).
Inter-observer reliability
The overall inter-observer reliability was high to excellent; the details are provided in Table 3. The Bland–Altman analysis of the mean inter-observer measurements is shown in Fig. 5B (mean difference: 0.53 mm, upper/lower limit: 1.63/−0.58). One measurement exceeded the levels of agreement.
Comparing both software versions, there were lower mean differences between both observers in measurements of perimeter based ED as well as LCA and RCA ostia heights when using the semi-automatic segmentation. The corresponding mean differences are listed below.
Semi-automatic segmentation
There was a not significant difference between the means for the two observers of 0.1 ± 0.6 mm for the perimeter-derived ED and of 0.02 ± 0.7 mm for the surface-area-derived ED when semi-automatic segmentation was applied. The inter-observer reliability for the perimeter- and surface-area-derived ED was excellent (ICC 0.99, 95 % CI 0.97–0.99/ICC 0.99, 95 % CI 0.97–0.99).
The semi-automatically assessed mean LCA heights between the observers differed significantly, with a mean difference of 0.9 ± 1.5 mm, p = 0.01, whereas the mean difference in the RCA height was not significant (0.9 ± 2.4 mm, p = 0.06). The inter-observer reliability was high, with an ICC of 0.90 for the LCA (95 % CI 0.75–0.96) and an ICC of 0.94 for the RCA (95 % CI 0.87–0.97).
Manual segmentation
The manual segmentation revealed mean differences of 1.0 ± 1.0 mm and 0.8 ± 0.8 mm for the perimeter- and surface-area-derived diameters, respectively. The inter-observer reliabilities for the perimeter- and surface-area-derived EDs were excellent (ICC 0.95, 95 % CI 0.67–0.99/ICC 0.96, 95 % CI 0.64–0.99).
The automatically assessed mean LCA heights differed significantly between the observers, with a mean difference of 1.6 ± 2.0 mm, p < 0.001, whereas there was no significant difference in the mean RCA heights (0.9 ± 1.7 mm, p = 0.01). The inter-observer reliability was high, with an ICC of 0.81 for the LCA (95 % CI 0.39–0.93) and an ICC of 0.91 for the RCA (95 % CI 0.78–0.96).
Discussion
The results of this study show that software using semi-automatic segmentation of the aortic root and recognition of the aortic valve annulus, contours and angulation significantly reduces the evaluation duration, improves the inter-observer reliability, and maintains the accuracy of manual measurements, particularly for examiners with little experience in evaluating CT images prior to TAVR.
Accurate measurements of the aortic root and implantation path dimensions by CT are essential for correct prosthesis sizing and selection of the implantation path to perform successful implantation and avoid complications in patients undergoing TAVR. Aortic root measurements are particularly challenging and time consuming, and with an ever-increasing number of implantations performed worldwide, measurement accuracy and efficiency are more important. Several automatic or semi-automatic tools for the evaluation of CT data prior to TAVR are available. Automatic tools are only useful if they do not compromise the accuracy of the assessments, if they facilitate significant time saving, and if they support the complete range of measurements that are required for the implantation. Additionally, the possibility of manual editing of the measurements is essential, particularly for patients with abnormal anatomical characteristics or strong calcifications.
The software for routine 3D and advanced visualization that were applied in this study allowed all of the measurements that are required for implantation in our facility (Table 2), which correspond to the parameters recommended by Holmes et al. [3]. Converting to the manual mode or editing the automatically assessed findings was always possible. Only the editing of the centerline of the aortic root within the automatic segmentation tool was not possible.
Our study population consisted of patients presenting extended valvular calcifications in most cases, expected to be a challenge for the automatic annulus segmentation. However, automatic recognition of the aortic root failed only in two (6.7 %) patients and in four (13.3 %) patients the aortic annulus plane had to be adjusted manually. Due to heavy calcifications in most cases, minor adjustments of the contours of the aortic annulus were required for all patients. These results correspond to recently published study, which examined the same semi-automatic segmentation software [20]. Using the automatic segmentation, both, the correction of the aortic annulus plane as well as the adjustment of the annulus contours could be done very fast and easily.
According to the duration of each evaluation step, there were some outliers. Most of these outliers were caused by the failure of the automatic vessel segmentation, which was the same in the manual and the semi-automatic software version. Affected were measurements of the aorta and iliac runoffs.
Regarding the inter-software reliability, there were differences between the mean perimeter-based ED measurements. In the Bland–Altman analysis of both software types, there were only two measurements beside the level of agreement showing that both software versions can deliver significantly different values. Furthermore there were lower mean differences between both observers in measurements of perimeter based ED as well as LCA and RCA ostia heights when using the semi-automatic segmentation. This allows the conclusion that the semiautomatic-segmentation is more accurate for the measurements of variables, which are crucial prior to TAVR.
Furthermore in our study, the inter-observer agreement as shown in the ICC analysis was excellent and comparable to those of recently published studies [16, 20, 21]. Although the manual estimation of the diameter of the aortic annulus revealed excellent inter-observer reliability, the reliability when using semi-automatic segmentation was improved, which is beneficial, particularly for the novice examiner. Thus semi-automatic segmentation might improve the reproducibility and help to standardize the evaluation process.
There was a significant saving of evaluation time of approximately 12 % for the novice and of 10 % for the expert when using semi-automatic evaluation. Therefore, semi-automatic evaluation allows more time efficient and accurate estimation of the required measurements in high-risk patients prior to TAVR. In clinical routine due to steady rising numbers of TAVR cases, time saving of couple of minutes per case does get clinically relevant. In our study the novice had already approximately 1 year of experience with the manual evaluation of TAVR cases, thus we conclude, that centers without any experience with TAVR evaluation, might benefit even more from the semi-automatic segmentation. Especially the preset and guided order of the evaluation steps might be advantageous.
One of the major limitations of the semi-automatic software is the semi-automatic segmentation itself. On one hand the semi-automatic segmentation provides a convenient guided way of evaluation of all currently required variables prior to TAVR. On the other hand the automatic segmentation has only limited flexibility, which might become an issue if additional variables become important or existing evaluation steps need to be replaced or even only adjusted. Additionally semi-automatic segmentation delivers results, which always need to be confirmed for their accuracy. Thus semi-automatic segmentation should support manual segmentation tools rather than replacing them.
There are some limitations of this study. This study is a single-center study describing the efficiency of two particular software solutions delivering data that are only comparable I the setting using the same software tools. To confirm our results, further studies with additional software solutions and larger study populations should be performed. Our study population consisted of patients who had datasets with good image quality and who had besides the severe aortic stenosis neither valvular nor coronary anatomical abnormalities, so the accuracy of the semiautomatic approach in patients with anatomical abnormalities could not be evaluated.
Conclusions
TAVR evaluations can be accomplished significantly faster and without compromising accuracy when using semi-automatic compared with manual segmentation, showing a benefit that is independent of the experience level of the examiner.
Abbreviations
- 3D VRT:
-
3D volume-rendering technique
- AS:
-
Aortic valve stenosis
- cMPR:
-
Curved multiplanar reconstruction
- CT:
-
Computed tomography
- ECG:
-
Electrocardiogram
- ED:
-
Effective diameter
- ICC:
-
Intraclass correlation coefficient
- TAVR:
-
Transcatheter aortic valve replacement
References
Leon MB, Smith CR, Mack M et al (2010) Transcatheter aortic-valve implantation for aortic stenosis in patients who cannot undergo surgery. N Engl J Med 363:1597–1607
Kodali SK, Williams MR, Smith CR et al (2012) Two-year outcomes after transcatheter or surgical aortic valve replacement. N Engl J Med 366:1686–1695
Holmes DR Jr, Mack MJ, Kaul S et al (2012) 2012 ACCF/AATS/SCAI/STS expert consensus document on transcatheter aortic valve replacement. J Am Coll Cardiol 59:1200–1254
Lehmkuhl L, Foldyna B, Haensig M et al (2013) Role of preprocedural computed tomography in transcatheter aortic valve implantation. RoFo 184(10):941–949
Wilson AB, Webb JG, Freeman M et al (2012) Computed tomography-based sizing recommendations for transcatheter aortic valve replacement with balloon-expandable valves: comparison with transesophageal echocardiography and rationale for implementation in a prospective trial. J Cardiovasc Comput Tomogr 6:406–414
Jilaihawi H, Kashif M, Fontana G et al (2012) Cross-sectional computed tomographic assessment improves accuracy of aortic annular sizing for transcatheter aortic valve replacement and reduces the incidence of paravalvular aortic regurgitation. J Am Coll Cardiol 59:1275–1286
Lehmkuhl L, Foldyna B, Von Aspern K et al (2013) Inter-individual variance and cardiac cycle dependency of aortic root dimensions and shape as assessed by ECG-gated multi-slice computed tomography in patients with severe aortic stenosis prior to transcatheter aortic valve implantation: is it crucial for correct sizing? Int J Cardiovasc Imaging 29(3):693–703
Kempfert J, Van Linden A, Lehmkuhl L et al (2012) Aortic annulus sizing: echocardiographic vs. computed tomography derived measurements in comparison with direct surgical sizing. Eur J Cardiothorac Surg. doi:10.1093/ejcts/ezs064
Tzikas A, Schultz CJ, Piazza N et al (2011) Assessment of the aortic annulus by multislice computed tomography, contrast aortography, and trans-thoracic echocardiography in patients referred for transcatheter aortic valve implantation. Catheter Cardiovasc Interv 77:868–875
Binder RK, Webb JG, Willson AB, Urena M, Hansson NC, Norgaard BL et al (2013) The impact of integration of a multidetector computed tomography annulus area sizing algorithm on outcomes of transcatheter aortic valve replacement: a prospective, multicenter, controlled trial. J Am Coll Cardiol 62:431–438
Stabile E, Sorropago G, Cioppa A et al (2010) Acute left main obstructions following TAVI. EuroIntervention 6:100–105
Gurvitch R, Webb JG, Yuan R et al (2011) Aortic annulus diameter determination by multidetector computed tomography: reproducibility, applicability, and implications for transcatheter aortic valve implantation. J Am Coll Cardiol Cardiovasc Interv 4:1235–1245
Achenbach S, Delgado V, Hausleiter J, Schoenhagen P, Min JK, Leipsic JA (2012) SCCT expert consensus document on computed tomography imaging before transcatheter aortic valve implantation (TAVI)/transcatheter aortic valve replacement (TAVR). J Cardiovasc Comput Tomogr 6:366–380
Achenbach S, Schuhbäck A, Min JK, Leipsic J (2013) Determination of the aortic annulus plane in CT imaging-a step-by-step approach. J Am Coll Cardiol Imaging 6:275–278
Hutter A, Opitz A, Bleiziffer S et al (2010) Aortic annulus evaluation in transcatheter aortic valve implantation. Catheter Cardiovasc Interv 76:1009–1019
Schuhbaeck A, Achenbach S, Pflederer T et al (2013) Reproducibility of aortic annulus measurements by computed tomography. Eur Radiol. doi:10.1007/s00330-014-3199-5
Lücke C, Foldyna B, Andres C et al (2014) Post-processing in cardiovascular computed tomography: performance of a client server solution versus a stand-alone solution. Fortschr Röntgenstr 186:1111–1121
Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174
Bland JM, Altman DG (1999) Measuring agreement in method comparison studies. Stat Methods Med Res 8:135–160
Lou J, Obuchowski NA, Krishnaswamy A et al (2014) Manual, semiautomated, and fully automated measurement of the aortic annulus for planning of transcatheter aortic valve replacement (TAVR/TAVI): analysis of interchangeability. J Cardiovasc Comput Tomogr 9(1):42–49
Van Linden A, Kemfert J, Blumenstein J et al (2014) Manual versus automatic detection of aortic annulus plane in a computed tomographyscan for transcatheter aortic valve implantation screening. Eur J Cardiothorac Surg 46(2):207–212
Acknowledgments
The authors of this manuscript declare a cooperation contract between the Department of Interventional and Diagnostic Radiology of the Heart Center in Leipzig and the Siemens Company (Siemens AG, Healthcare, Erlangen, Germany), which supported the study and provided the required equipment and software.
Conflict of interest
None.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Foldyna, B., Jungert, C., Luecke, C. et al. CT evaluation prior to transapical aortic valve replacement: semi-automatic versus manual image segmentation. Int J Cardiovasc Imaging 31, 1233–1242 (2015). https://doi.org/10.1007/s10554-015-0662-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10554-015-0662-6