Introduction

Thyroid nodules are very common with a prevalence of greater than 50 % after 60 years, however very few of them represent cancer (5–10 %) [1]. US examination is the preferred first line modality of imaging in evaluating thyroid nodules, providing detection of signs associated with malignant nature, such as microcalcifications, hypoechogenicity, irregular margins or absent halo, intranodular vascularization greater than perinodular, and deeper than wide shape [2]. Even though sensitivity and specificity can be improved by combining these parameters, no single suspicious US feature is sufficiently sensitive or specific for diagnosing malignancy and the overall performance of US is not satisfactory [36]. Currently, fine needle aspiration cytology (FNAC) is the standard procedure to determine malignancy and is the method with the highest specificity (60–98 %), but with a varying sensitivity of 54–90 % [7]. However, FNAC is invasive and about 10–20 % of biopsies yield inadequate results and need repetition.

A classical criterion of malignancy is hardness upon palpation and with the introduction of elastography, an assessment of tissue stiffness has become available. Qualitative and semi-quantitative elastosonography methods have been evaluated in a variety of studies on the thyroid with promising results [812]. A meta-analysis of eight studies, yielded a mean sensitivity of 92 % and mean specificity of 90 % [13]. However, there is a significant heterogeneity of performance of elastosonography in these studies due to the qualitative and semi-quantitative nature of methods. Increased reliance on nodule stiffness quantification is therefore needed to bring about a standardization of diagnostic criteria. Acoustic radiation force impulse technology (ARFI), already used in different fields, has also been applied to evaluate diffuse thyroid diseases, [14, 15], and recently to differentiate thyroid nodules such as in the study performed by Gu et al. [16].

The aim of our study was to assess the effectiveness of ARFI technology in the diagnostic evaluation of thyroid nodules and to assess the intraobserver and interobserver agreement of two operators with different experience as well.

Materials and methods

Patients

This study includes 74 consecutive patients (52 females, 22 males; age range 27–77 years, mean: 41 years) with 82 thyroid nodules examined by two radiologists (with different radiological experience, respectively 10 and 5 years). Informed consent was obtained from all patients and the study was performed in accordance with the ethical guidelines of the Helsinki Declaration and approved by the local ethics committee.

The study period was from September to October 2012. All patients presenting at our Radiology department, for work-up of thyroid nodules were evaluated for inclusion in the study. Inclusion criteria were the presence of a thyroid nodule ≥10 mm, FNAC and/or surgery planned at the time of ultrasound examination and finally performed. Exclusion criteria were nodules less than 1 cm, cystic lesions with little solid component, nodules with conspicuous calcifications, or nodules without final histological or cytological diagnosis. All patients received an ultrasound of the thyroid gland, including color-Doppler ultrasound, followed by ARFI evaluation. Cytology or post-surgical histopathological results were used as standard of reference method for the diagnosis of a benign or malignant thyroid nodule.

FNAC and histopathology

All included patients received either cytology using FNAC and/or histology from thyroid surgery. FNAC was performed with a 25-gauge needle attached to a 20 ml syringe. Adequacy of aspirates was defined according to the guidelines of the Papanicolaou society [17].

Acoustic radiation force impulse (ARFI) imaging

Acoustic radiation force impulse imaging (Virtual Touch™ Tissue Quantification, Siemens ACUSON S2000) involves targeting of an anatomic region to be interrogated for elastic properties with a region-of-interest (ROI) cursor while performing real-time B-mode imaging. The two kinds of imaging methods for ARFI include virtual touch tissue imaging (VTI) and virtual touch tissue quantification (VTQ) (Siemens Medical Solutions, Mountain View, CA, USA). They are used to assess qualitative and quantitative lesion stiffness, displayed on a dual screen with one side showing the B-mode image and the other side showing the elastic image. Patients were in a supine position with a fully extended neck and the measurements were performed with the patient in apnea to avoid movements that result in erroneous measured values. Tissue within a ROI (10 × 5 mm) is mechanically excited using short-duration (262 μs) acoustic pulses. The acoustic pulses generate localized tissue displacements within the ROI. Results are expressed in m/s with a measurement range of 0.5–4.9 m/s. All examinations were performed using a linear probe at 9–12 MHz, optimal for thyroid tissue evaluation. Two radiologists performed five measurements for each nodule obtaining a mean value of the shear wave velocity (denominated as virtual touch tissue quantification value or VTQ value), first with the ROI placed in the normal thyroid gland, (in the same lobe of the nodule, or in the other lobe in case of a too large nodule); thereafter, five measurements with the ROI placed within the thyroid nodule. If the measurement was not reliable, the signature “X.XX” was displayed on the screen and this was usually due to motion or breathing. Measurements in the normal gland were performed at the same depth as that of the nodule. For not altering the measurements, the operators did not include in the scan nodule degeneration phenomena such as macrocalcifications and tensive, diffuse cystic areas. Whereas in the thyroid gland, vascular structures were avoided. Each operator who performed ARFI measurements was blinded to all patients’ clinical data and to the results obtained by the other operator.

Statistical analysis

Data were collected prospectively and recorded by each radiologist in a computerized spreadsheet (Excel, Microsoft). Statistical analysis was carried out using SPSS statistical package, version 13.0 (SPSS Inc., Chicago, IL, USA). All measured data were presented as the mean ± SD. In order to assess the diagnostic performance of ARFI, the median value of the five measurements of the shear wave velocity of normal tissue and thyroid nodules was compared using the receiver operating characteristic (ROC) curve calculation.

Optimal cut-off values were chosen so that the sum of sensitivity and specificity would be the highest. Sensitivity, specificity, positive predictive and negative predictive value of ARFI were calculated. The P < 0.05 was considered statistically significant. Intra- and inter-reader variability for ARFI between the two operators was defined using Cohen’s k test (strength of agreement: <0.2 poor; 0.21–0.40 fair; 0.41–0.6 moderate; 0.61–0.80 good; 0.81–1.00 very good).

Results

The results obtained by histological examination after surgery revealed, on a total of 82 nodules analyzed, the presence of 22 malignant tumors (17 papillary carcinomas; 4 medullary carcinomas; 1 signet cell metastasis obtained after autopsy). The remaining 60 nodules included 48 nodules of benign hyperplasia; 4 inflammatory nodules (2 Graves–Basedow disease and 2 Hashimoto’s thyroiditis); 4 Plummer’s adenomas and 4 hemorrhagic cysts. The size of nodules ranged between 10 and 50 mm (mean 18.9 mm, SD: 10.08 mm) 39 of the 82 nodules were located in the right lobe of the gland; 36 nodules in the left lobe; 7 nodules were sited at the isthmus of the thyroid gland.

Virtual touch tissue quantification analysis

Measurements in the normal gland were performed with a ROI at the same depth as that of the nodule, and the depth of the measurements was between 1.0 and 3.1 cm. In 76 cases the ARFI measurements in the normal gland were performed at the same lobe as that of the nodule and in six cases, due to large dimensions of the nodule that occupied the whole lobe or almost all of it, the measurement of the normal gland was performed on the contralateral lobe.

Nodules that resulted benign at cytology and/or histology, in ARFI, had lower VTQ (shear wave velocity) values, with a mean of 2.141 ± 0.392 m/s, and showed no significant difference compared to the surrounding normal thyroid parenchyma (2.035 ± 0.518 m/s; P = 0.427). Among the benign nodules, the different groups that were formed according to the US features (hypoechoic versus isoechoic versus hyperechoic; vascularity pattern: absent versus perinodular versus peri- and intranodular; regular margin and thin halo versus irregular margins and thick halo), did not have any significant differences in ARFI elastic parameters (P > 0.045).

Whereas, the nodules that resulted malignant at histology had VTQ values with a mean of 3.751 ± 1.485 m/s, which was significantly higher than the value of benign nodules (2.141 ± 0.392 m/s; P < 0.001) and also higher than the value of the surrounding normal thyroid parenchyma (2.106 ± 0.491 m/s; P < 0.001). According to ROC curves (AUROC = 0.86 for Reader 1; 0.81 for Reader 2) sensitivity, and specificity for each reader were respectively 90 and 75 % for a cut-off value of shear wave of 2.455 m/s for Reader 1; 90 and 72 % for a cut-off value of shear wave of 2.365 m/s for Reader 2 (as shown in Figs. 1, 2; Table 1). The PPV and NPV for each reader were respectively 90.91 and 96.55 % for Reader 1; 90 and 95.90 % for Reader 2. Interobserver agreement was in the range considered good (k = 0.75). Intraobserver agreement was in the range considered good for benign nodules (k = 0.65 for more experienced reader and 0.63 for less experienced reader); and in the range considered good for malignant nodules (k = 0.65 for both readers). Figure 3 is an example of a benign nodule, and Fig. 4 of a malignant one.

Fig. 1
figure 1

ROC curves of ARFI for Reader 1 (AUROC = 0.86) (a) and for Reader 2 (AUROC = 0.81) (b) are showed

Fig. 2
figure 2

Values of sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) of ARFI for Reader 1 (a) and for Reader 2 (b) are reported in the histogram

Table 1 The table shows the cut-off of both Readers for discriminating the nature of the nodules and their associated statistic values
Fig. 3
figure 3

Baseline US shows a well marginated, prevalently hyperechoic nodule with hypoechoic capsule (a) which at color-Doppler presented Pattern III and at ARFI 1.89 and 1.56 m/s of shear wave velocity, respectively, for operator 1 (c) and operator 2 (d). Histology after surgery demonstrated that the nodule was benign nodular hyperplasia

Fig. 4
figure 4

Baseline US shows a non-well marginated, prevalently hypoechoic nodule (a) which at color-Doppler presented perilesional with some tiny intralesional vascular signs and at ARFI 2.59 and 3.19 m/s of shear wave velocity, respectively, for operator 1 (c) and operator 2 (d). At FNAC the lesion was classified as Thy 3; histological examination after thyroidectomy demonstrated that the nodule was a papillary carcinoma

Discussion

The thyroid gland is a viable target for elastography. Real-time elastography (RTE) a qualitative method has been evaluated in a number of studies [813, 1820]. A recent meta-analysis performed by Bojunga et al. [13] summarizes the results of several studies reporting a global sensitivity and specificity of 92 and 90 %, respectively, for RTE in the diagnosis of malignant thyroid nodules. Another recent study by Cantisani et al. [8] proved high accurate diagnostic value in the differentiation of thyroid nodules of semiquantitative elastosonography. In spite of good performance reported by these studies, thyroid US elastography has not yet gained widespread clinical use, with one of the major issues being low interobserver and intraobserver agreement. A study evaluating interobserver agreement both in data acquisition and scoring for thyroid US elastography was reported by Park et al. [21]. Three radiologists independently performed elastography data acquisition and scored on 52 malignant thyroid nodules. With external compression elastography, they found no interobserver agreement among the three radiologists. They attributed this lack of interobserver agreement in elastography mainly to the fact that the extent of compression influences both the elastography image and, consequently, elasticity score. Their results showed that the variability in elastography data acquisition is larger than that in scoring. On the other hand however, more recently Lim et al. [22] obtained good interobserver and intraobserver agreement with in vivo compression elastography.

Analyzing the state of the art of thyroid elastography, EFSUMB guidelines [23] (European Federation of Societies of Ultrasound in Medicine and Biology) conclude with the practical recommendations: “Elastography is an additional tool for thyroid lesion differentiation.” but the recommendation follows with: “Based on expert opinion, elastography may be used to guide follow-up of lesions negative for malignancy at FNAC”. Thus the limited use recommended shows the need for improvement of the modality and further thyroid validation.

Surpassing the free-hand compression method of RTE using a standardized acoustic pulse for compression, together with reliance on stiffness quantification could bring about improved performance and reproducibility. These are offered by the recent technologies enabling a quantitative determination of tissue elasticity such as the ARFI technology and shear wave elastography.

The principle of ARFI is that ultrasounds as mechanical waves, require a medium in which to spread. The wave propagation speed depends on the elastic properties of the medium and with the study of sound wave propagation speed in a medium, it is possible to assess the mechanical strain properties of the material [24, 25]. A vibration is transmitted to the tissue and it propagates as longitudinal compressional waves, and also as transverse vibrations known as shear waves. The velocity at which shear waves travel through tissue is proportional to the square root of the shear modulus (another measure of stiffness) of the tissue [26]. The velocity of the shear waves can be tracked by the small tissue displacements they cause using ultrasound methods similar to those used for normal elastography [27]. ARFI offers elasticity images as well as quantitative estimations of stiffness [21]. Integrated into a conventional ultrasonographic system, ARFI imaging can be performed during a standard examination. The target region is selected on a conventional B-mode image, and information is obtained in real time [2830]. The quantification entails estimation of the shear wave velocity, which is measured in meters per second. The stiffer a tissue is, the greater is the shear wave velocity.

ARFI was applied to diffuse thyroid pathology evaluation with good results [14, 15]. Differently from the preliminary experience published by Friederich-Rust et al. [14] who used a curved array probe, more recently a linear probe at 9–12 MHz, optimal for thyroid tissue evaluation, is now available.

Gu et al. reported the first experience using ARFI to differentiate benign and malignant thyroid nodules. They evaluated 98 nodules and achieved sensitivity, specificity, positive predictive value, negative predictive value, and diagnostic accordance rate of 86.36, 93.42, 79.17, 95.95, and 91.84 %, respectively, based on the standard VTQ value (2.555 m/s). In total, 77.6 % (59 of 76) of the benign nodules showed softer and equal images in the VTI mode, and 77.3 % (17 of 22) of the malignant nodules showed stiffer images (P < 0.001). They concluded that ARFI imaging has high sensitivity and specificity in evaluating benign and malignant thyroid nodules and therefore a good diagnostic value in clinical applications [16]. More recently, Friedrich-Rust et al. [14] compared ARFI and RTE to differentiate benign from malignant thyroid nodules. They showed that the combination of RTE with ARFI imaging improved specificity for the diagnosis of malignant nodules from 72 (RTE alone) to 92 % but sensitivity was reduced from 76 to 58 %. Both the techniques achieved an excellent negative predictive value while the two methods together did not improve NPV.

In accordance with Gu et al. we achieved similar results, obtaining sensitivity, specificity, PPV and NPV of ARFI for reader 1 and 2 of 90, 75, 90.9 and 96.5 %, and (cut-off value of shear wave: 2.455 m/s); 90, 72, 90 and 95.90 % (cut-off value shear wave: 2.365 m/s), respectively. In addition, we tested the interobserver variability as well. We obtained good results, with a Cohen’s k value of 0.755, which is consistent with the result of the study of Lim et al. [22] (k = 0.77) and Merino et al. [31] (k = 0.82) using different elastographic techniques.

There were several limitations of this study. This was a pilot study with a limited sample size, especially with regard to malignant lesions and no follicular cancers; thus, the accuracy results are not definitive. Because of the limited patient population, detailed subgroup analysis could not be performed to determine whether specific combinations of CDUS criteria and ARFI can increase the diagnostic performance compared to CDUS alone. In the quantification model, the measure of the ROI is fixed: we used a 10 × 5 mm ROI and it cannot be modified. Nodules with cysts and calcifications had to be excluded due to the impossibility of placing the non-modifiable ROI inside the nodule in a cyst-free and calcifications-free portion of the nodule.

Conclusions

This work is a feasibility study evaluating ARFI imaging. Its results suggest that ARFI imaging is a reproducible method which can be utilized with good diagnostic performance in the thyroid for discriminating benign and malignant nodules using the cut-off value of 2.455 m/s. ARFI imaging of thyroid nodules seems to have the potentiality for differentiating benign nodules from suspicious ones, thus aiding in the decision to proceed to FNAC biopsies with greater confidence and efficiency. Benign nodules at ARFI can be managed via follow-up observations. Larger studies and multicentric ones are needed to validate this method and better define its utility.