Abstract
Objective
Evaluation of the diagnostic efficacy and interobserver agreement of Q-elastography in the differentiation of benign from malignant thyroid nodules.
Methods
A total of 344 thyroid nodules in 288 patients were examined with grey-scale and colour Doppler ultrasound (CDUS) and Q-elastography by two independent operators. Nodules with hypoechogenicity, poorly defined margins, microcalcifications, and intralesional vascularity were classified as suspicious. Diagnostic performances of CDUS features and Q-elastography for predicting thyroid malignancy were estimated using ROC analysis. Cytology or histopathology was the reference standard. Interobserver agreement in the evaluation of CDUS and Q-elastography was assessed using Cohen's k-statistic.
Results
Q-elastography showed excellent diagnostic performance for the prediction of thyroid malignancy, with sensitivity of 93 % and specificity of 92 % for operator 1 (best cutoff at 2.02), and sensitivity of 84 % and specificity of 79 % for operator 2 (best cutoff at 1.86). Performance of Q-elastography was superior to that of CDUS. Reproducibility of the findings was excellent for both Q-elastography and CDUS features as assessed with Cohen's k, which was highest for strain ratio measurements (0.95) and lowest for the echogenicity score (0.83).
Conclusions
Q-elastography showed excellent performance. It is a valid and reproducible diagnostic method as well as a promising tool for identifying suspicious solid thyroid nodules needing cytological assessment and surgery.
Key Points
• Elastography is an additional tool for optimal characterisation of malignant thyroid nodules.
• The use of semiquantitative elastographic evaluation increases the diagnostic performance,
• The interobserver agreement of quantitative elastography can be considered to be good.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
The prevalence of thyroid nodules in the general population is about 3–8 % [1–3] and greater than 50 % in people over 65 years of age [3]. Although most thyroid nodules are benign, the prevalence of thyroid cancer is as high as 5–15 % [4].
Palpation can detect large and firm nodules suspicious for malignancy, but its reported accuracy is low [1, 3]. Ultrasound and colour Doppler ultrasound (CDUS), despite being extremely accurate in identifying thyroid nodules, has limited effectiveness in differentiating benign and malignant nodules and in selecting the cases requiring fine-needle aspiration biopsy (FNAC) [5, 6]. Although several ultrasound features (microcalcifications, hypoechogenicity, irregular margins and intranodular vascularity) are found to be correlated with malignancy in thyroid nodules, they are not highly predictive for malignancy [7]. Ultrasound sensitivity and specificity are considerably variable from study to study and range between 52–81 % and 54–83 %, respectively [5]. FNAC is the standard procedure for determining whether a thyroid nodule is cancerous, but it is invasive, subject to sampling errors, limited in its diagnostic capabilities, expensive and may cause minor complications [8–10].
More recently, elastography has been introduced to select nodules requiring FNAC with higher accuracy than conventional ultrasound. Compressive or quantitative elastography is based upon the principle that malignancies have stiffer tissues than benign lesions and that, under compression, the softer parts of tissues deform more easily than the harder parts [11]. The force of compression can be measured either directly by the operator's hand or by carotid artery pulsation [11]. The evaluation of stiffness can be qualitative with a colour-coded or grey scale system or quantitative with offline measurements [11]. Elastography has been proven to be a promising imaging technique, with good diagnostic accuracy with both qualitative and quantitative methods [12–25]. Rago et al. [18] showed sensitivity and specificity in detecting malignant thyroid nodules to be as high as 97 % and 100 % by using ultrasound elastography. Similarly, in a study with 145 thyroid nodules that were referred for surgical thyroid removal, Hong et al. [18] reported a sensitivity of 88 % and specificity of 90 %. However, in spite of these excellent results, Park et al. [26] reported a significant interobserver variability in the application of some of the interpretation techniques and suggested that the assessment of nodule elasticity is influenced by variability in both data acquisition and interpretation.
Therefore, in our prospective study, we assessed the accuracy of Q-elastography as compared with CDUS in a large patient population. Moreover, we assessed the interobserver variability between two operators with different experience, with pathological analysis as the reference standard. We employed free-hand compression Q-elastography with a new tool that offers time-elasticity graphs to be plotted over a region of interest in the compression or relaxation cycles in order to obtain semiquantitative evaluation of the tissue stiffness.
Materials and methods
Study design
In this prospective study, designed as an extension and follow-up of our former experience [24] to evaluate our preliminary experience with a larger sample and estimate the interobserver agreement, data were collected from a series of consecutive thyroid nodules evaluated with conventional ultrasound and elastography in patients presenting at our institution between May 2010 and March 2012. After ultrasound, FNAC was obtained, and suspicious or malignant results led to surgery. For all patients, cytology or post-surgical histopathological results were considered the standard of reference.
Subjects
Of 347 consecutive patients evaluated with thyroid nodules (with dimensions greater than 5 mm), 59 were excluded owing to a lack of cytological or histopathological data (38 patients) or to refusal of the intervention (11 patients) or inadequate specimen material (10 patients).
The remaining 288 patients (median age: 53 years old, range 15–85, 198 female and 90 male) with 344 nodules underwent both conventional ultrasound and elastographic examination by two independent radiologists, one with 10 years of experience in thyroid ultrasound and the other with 4 years of experience in thyroid ultrasound, both blinded to patients' data. Institutional review board approval was obtained and all subjects signed the informed consent.
Ultrasound and Elastography Technique
Both examinations were performed with a Toshiba Aplio XG machine (Toshiba, Osaka, Japan) using a 5-13-MHz linear-shaped probe.
Conventional ultrasound examinations
Each lesion was characterised according to ultrasound parameters [echogenicity, margins and halo features, microcalcifications and colour Doppler (CDUS) pattern]. The nodules included in this study were evaluated on the basis of the statement criteria of the “Revised American Thyroid Association Management Guidelines for Patients with Thyroid Nodules” [27]. In order to obtain statistical elaboration, we processed the categorical data of CDUS features, as if they were numerical data, by employing the subsequent simplified score criteria:
-
echogenicity: marked hypoechogenicity (more hypoechoic than the strap muscles) as a sign of malignancy was scored 1, whereas hyperechogenicity or isoechogenicity, as signs of a benign nodule, was scored 0;
-
margins: margin irregularity, an asymmetric halo or microcalcific halo as signs of malignancy was scored 1, whereas a symmetrical halo or regular margins as signs of a benign lesion was scored 0;
-
blood flow pattern at CDUS: absent (pattern I) and peripheral (pattern II) as signs of a benign lesion were scored 0, whereas peri-intralesional (pattern III) as a sign of malignancy was scored 1;
-
microcalcifications (excluding echogenic spots with a reverberation artefact, a finding indicative of inspissated colloid): absent microcalcifications as a sign of a benign lesion scored 0, whereas if present as a sign of malignancy they scored 1.
Elastography examinations
Elastography was performed with the Elasto-Q technique (Toshiba's semiquantitative relaxation elastography). This technique allows compression of the target tissue with the probe and visualisation, even if not in real time, of the dynamics of the compression by recording it on a compression/time curve, to allow a standardisation of the measures on a sinusoid-shaped compression-decompression curve. The ultrasound probe was placed gently on the thyroid in a transverse orientation. In order not to alter the measurements, the operators did not include in the imaging macrocalcifications and cystic areas. The exclusion criterion was: insufficient normal tissue around the target mass to obtain an adequate strain ratio. A series of compressions was performed, the compression dynamics were visualised and data were recorded if the dynamics fitted the requirements. Upon a compression whose dynamics were the most symmetrical, corresponding to the best cycle, we obtained colorimetric strain images in off-line processing, and thereafter regions of interest (ROIs) were set to obtain strain data. We evaluated the strain corresponding to the highest acceleration value during decompression. The strain ratio was calculated by dividing the strain value of the normal tissue at the same imaging section of the lesion with that of the nodule. For each operator, in the present article, lesion strain ratios were recorded and compared with the results of cytological or histopathological analysis as the reference method.
Statistical analysis
Data were collected prospectively and recorded by each radiologist performing CDUS and elastography and were entered into a computerised spreadsheet (Excel 2007, Microsoft Corp., USA). Statistical analyses were carried out using statistical software (SAS system for Windows, version 9.1.3; SAS Institute, Cary, NC, USA).
The sensitivity, specificity, positive predictive value, negative predictive value and diagnostic accuracy of each test were calculated. The optimal cutoff value for the strain ratio was calculated using receiver-operating characteristic (ROC) analysis. Areas under the ROC curve were compared using the Bonferroni test.
Interobserver variability for choosing CDUS feature descriptors in each category and for the measurement of the strain ratio (considering the strain ratio qualitatively as categorical data: benign or malignant) between the two observers was defined by using Cohen’s kappa (κ) statistic, which provides the amount of agreement between two unique raters after considering chance agreement [28, 29]. Values were interpreted according to Landis and Koch, who ascribe κ of <0.00 as “poor”, 0.00–0.20 as “slight”, 0.21–0.40 as “fair”, 0.41–0.60 as “moderate”, 0.61–0.80 as “substantial” and 0.81–1.00 as “almost perfect” [28]. All statistical calculations were performed using Stata version 12.0 (Stata Corp., College Station, TX, USA).
Results
Final diagnosis was based on the cytological and histological findings: FNAC was used as the reference standard for the diagnosis of benign nodules if the patients had not undergone thyroid surgery and histopathology was used if the patients had undergone thyroid surgery. The size of nodules ranged between 4 and 50 mm (mean 19.1 mm, SD: 12.68 mm). The final diagnosis of the 344 nodules was that 232 were benign (Fig. 1) and 112 were malignant (Fig. 2). Among the malignant nodules, 102 were papillary cancers, 6 were follicular carcinomas, 3 were medullary carcinomas and 1 was an anaplastic carcinoma. Among the benign nodules, 186 were hyperplastic nodules, 41 were adenomas and 5 were focal thyroiditis.
The mean age of patients with malignant nodules was younger than that of patients with benign nodules (P < 0.001). There was no association between the sex of the patients and the malignancy of the nodules (P = 0.079). The mean strain ratio of malignant nodules (3.47 for the first operator with 95 % confidence Interval: 95 % CI 3.13–3.81, and 2.89 for the second operator with 95 % CI 2.59–3.17) was significantly different from those of the benign lesions (1.29 for the first operator with 95 % CI 1.19–1.40, and 1.49 for the second operator with 95 % CI 1.36–1.62).
Receiver-operating characteristic (ROC) analysis was estimated for operator 1 (the expert operator) and for operator 2 (non-expert), for each CDUS feature and for the strain ratio measurements (Table 1); the results of the best performance found for the strain ratio measurements in both operators were reported (highest value for area under the ROC curve). The lowest value of the area under the ROC curve was found for the microcalcification score for both operators. The value of the area under the ROC curve was significantly higher for expert operator 1 in comparison with non-expert operator 2 for the margins score, blood flow pattern score and strain ratio score (Table 1).
In Table 1 we also present the ROC analysis of CDUS features considered together (and not individually this time) as an expression of the general performance of CDUS. The best performance was found for the strain ratio measurements as shown by the highest area under the ROC curve with a significant difference between the performance of the strain ratio and the total of CDUS features, P = 0.0001.
Q-elastography showed an excellent diagnostic performance: for operator 1 (Fig. 3) with a strain ratio best cutoff point selected at 2.02, sensitivity was 93 % and specificity 92 %; for operator 2 (Fig. 4) with a strain ratio best cutoff point found at 1.86, sensitivity was 84 % and specificity 79 %.
Performance with the calculation of the area under the ROC curve and also sensitivities and specificities of Q-elastography with the strain ratio measurement were significantly higher for the first (expert) operator in comparison with the second (non-expert) operator. Inter-operator agreement resulted in excellent Cohen's kappa (κ) statistics: between the highest for the strain ratio measurements (0.95) and the lowest for the echogenicity score (0.83). All k values were within the range 0.81–1.00, considered as excellent agreement.
Discussion
Thyroid nodules are a common clinical problem nowadays with an increased ultrasound incidental detection, with its main issue being represented by the need to exclude malignancies, which occur in 5–15 % of nodules [1–5]. Most of the ultrasound features are not sufficiently predictive of the malignancy of a nodule. In the cases when several features are present, they are associated with a fair likelihood of thyroid malignancy and ultrasound characteristics are more reliable indicators of potential malignancy than nodule size [5–7].
The need to further reduce the number of unnecessary FNACs might be met with the use of elastography as a separate tool or in combination with CDUS features. Especially when there are multiple nodules in the thyroid gland, suspicious CDUS features with the aid of elastography may be helpful in targeting the right nodule for aspiration.
The recent American Thyroid Association guidelines [27] state that: “with the exception of suspicious cervical lymphadenopathy, which is a specific but insensitive finding, no single ultrasound feature or combination of features is adequately sensitive or specific to identify all malignant nodules”.
In the literature, a number of studies on elastography of thyroid malignancies report encouraging results [11–25]. In a recent meta-analysis, eight studies, selected on the basis of a high rating in quality assessment, that included a total of 639 thyroid nodules were analysed [30]. For the diagnosis of malignant thyroid nodules with elastography, the overall mean sensitivity of the eight studies was 92 % (confidence interval 88–96) and the overall mean specificity was 90 % (confidence interval 85–95). However, a significant heterogeneity was found with regard to specificity in the different studies. In addition, a recent study [31] using qualitative elastography assessed using two different systems of colour-coded elastograms (Rago criteria and Asteria criteria) showed inferior performance of qualitative elastography in the differentiation of malignant and benign thyroid nodules compared with grey-scale ultrasound features in combination. Conversely, in the present study we achieved a good performance using a different method, semiquantitative Q-elastography, as shown by the amount of the area under the ROC curve (Table 1) for both operators (operator 1, the expert operator, 0.938; operator 2, the non-expert, 0.838). The sensitivity and specificity were respectively 93 % and 92 % for the first operator (with a strain ratio best cutoff point at 2.02) and 84 % and 79 % for the second operator (with a strain ratio best cutoff point at 1.86). In the comparison of each CDUS feature and strain ratio measurement, the results of the best performance were found for the strain ratio measurements in both operators (highest value for the area under the ROC curve). Therefore semiquantitative evaluation using the strain ratio was shown to be a more accurate and objective imaging tool than ultrasound features, with better results than the studies with qualitative elastography.
The lowest value of the area under the ROC curve was found for the microcalcifications score for both operators (Table 1), which is in agreement with the well-known fact that microcalcifications are a highly specific sign of malignancy, but as they are not often encountered, this sign has low sensitivity and thus a low diagnostic performance.
Another issue that is still under debate is the interobserver variability. Park et al. in their study [26] found no interobserver agreement among three radiologists using free-hand compression with colour-coded qualitative elastography, concluding that the extent of compression influences the score. On the other hand, various studies found a good interobserver agreement: Merino et al. [32] and Ragazzoni et al. [33] employing a qualitative system that scored the nodules according to strain homogeneity found an excellent agreement between operators with a k = 0.82 (0.74-0.89) and a good accuracy (84 %, OR 29) and interobserver concordance (k test 0.643); Lim et al. [34], employing a semiquantitative quasistatic method based on carotid artery pulsation, found an overall agreement between operators ranging from good to very good.
The interobserver agreement between the two operators in this study assessed using Cohen's k was greater for the strain ratio measurements than for the CDUS features, being excellent for all features, but the highest for the strain ratio measurements (near to 1) and the lowest for the echogenicity score (0.83). However, we can remark that the performance of the expert operator was significantly better than that of the non-expert one (Table 1) for the strain ratio measurements and for the evaluation of the margins score and blood flow score. As in general ultrasound studies, with the semiquantitative Q-elastography method experience and being high up on the learning curve confer diagnostic improvements.
Although we operate in a referral centre, a variety of patients present to our department with a broad range of thyroid nodules (from low to high probability of malignancy), including many cases directly referred to us by the general practitioner as well. It was our aim to avoid a spectrum composition bias; thus, our patient inclusion criteria were not restricted to cases previously selected for FNAC or surgery. A limitation in this study is the need for off-line post-processing with an estimated time ranging between 3 and 5 min approximately for obtaining the strain ratio value.
In our opinion, areas that should be addressed for improvement in the method we employed are related to the following issues: the difficulty in providing harmonic dynamics of compression, with the resulting curve morphology; the positions of ROIs relative to each other–i.e. because the strain depends upon the depth of the ROI, the nodule ROI should be at the same depth of the gland ROI; ROIs should remain within the nodule during the whole cycle; tissue features (i.e. the presence of calcifications, cystic areas, thyroiditis). Solutions to these issues are required, and further studies with improved equipment and methods are needed in order to better validate Q-elastography and elastography for the thyroid nodules in general. Furthermore, pure quantitative techniques that seem promising, such as shear-wave elastography, could offer advantages and need to be tested on the thyroid.
According to the results of our study, we can conclude that Q-elastography is a valid and useful diagnostic method that helps to improve characterisation of thyroid nodules in order to select candidates for surgery and to follow up patients with benign features.
References
Wiest PW, Hartshorne MF, Inskip PD et al (1998) Thyroid palpation versus high-resolution thyroid ultrasonography in the detection of nodules. J Ultrasound Med 17:487–496
Tomimori E, Pedrinola F, Cavaliere H, Knobel M, Medeiros-Neto G (1995) Prevalence of incidental thyroid disease in a relatively low iodine intake area. Thyroid 5:273–276
Brander A, Viikinkoski P, Nickels J, Kivisaari L (1991) Thyroid gland: US screening in a random adult population. Radiology 181:683–687
Tumbridge WM, Evered DC, Hall R et al (1997) The spectrum of thyroid disease in a community: the Whickham Survey. Clin Endocrinol (Oxf) 7:481–93
Fish SA, Langer JE, Mandel SJ (2008) Sonographic imaging of thyroid nodules and cervical lymph nodes. Endocrinol Metab Clin N Am 37:401–17
Cappelli C, Castellano M, Pirola I et al (2007) The predictive value of ultrasound findings in the management of thyroid nodules. QJM 100:29–35
Hoang JK, Lee WK, Lee M, Johnson D, Farrell S (2007) US features of thyroid malignancy: pearls and pitfalls. Radiographics 27:847–860
Kim MJ, Kim EK, Park SI et al (2008) US-guided fine-needle aspiration of thyroid nodules: indications, techniques, results. Radiographics 28:1869–86
Gharib H, Papini E, Valcavi R, AACE/AME Task Force on Thyroid Nodules. American Association of Clinical Endocrinologists and Associazione Medici Endocrinologi et al (2006) Medical guidelines for clinical practice for the diagnosis and management of thyroid nodules. Endocr Pract 12:63–102
Cai XJ, Valiyaparambath N, Nixon P, Waghorn A, Giles T, Helliwell T (2006) Ultrasound-guided fine needle aspiration cytology in the diagnosis and management of thyroid nodules. Cytopathology 17:251–256
Garra BS (2011) Elastography: current status, future prospects, and making it work for you. Ultrasound Q 27:177–86
Lyshchik A, Higashi T, Asato R et al (2005) Thyroid gland tumor diagnosis at US elastography. Radiology 237:202–211
Lyshchik A, Higashi T, Asato R et al (2007) Cervical lymph node metastases: diagnosis at sonoelastography – initial experience. Radiology 243:258–67
Itoh A, Ueno E, Tohno E et al (2006) Breast disease: clinical application of US elastography for diagnosis. Radiology 239:341–50
Dighe M, Unmin B, Richardson ML et al (2008) Differential diagnosis of thyroid nodules with US elastography using carotid artery pulsation. Radiology 248:662–669
Hong Y, Liu X, Li Z, Zhang X, Chen M, Luo Z (2009) Real-time ultrasound elastography in the differential diagnosis of benign and malignant thyroid nodules. J Ultrasound Med 28:861–867
Luo S, Kim EH, Dighe M, Kim Y (2011) Thyroid nodule classification using ultrasound elastography via linear discriminant analysis. Ultrasonics 51:425–431
Rago T, Santini F, Scutari M, Pinchera A, Vitti P (2007) Elastography: new developments in ultrasound for predicting malignancy in thyroid nodules. J Clin Endocrinol Metab 92:2917–2922
Friedrich-Rust M, Sperber A, Holzer L et al (2010) Real-time elastography and contrast-enhanced ultrasound for the assessment of thyroid nodules. Exp Clin Endocrinol Diabetes 118:602–9
Asteria C, Giovanardi A, Pizzocaro A et al (2008) US-elastography in the differential diagnosis of benign and malignant thyroid nodules. Thyroid 18:523–531
Rubaltelli L, Corradin S, Dorigo A et al (2008) Differential diagnosis of benign and malignant thyroid nodules at elastography. Ultraschall Med 30:175–179
Tranquart F, Bleuzen A, Pierre-Renoult P, Chabrolle C, Sam GM, Lecomte P (2008) Elastography of thyroid lesions. J Radiol 89:35–39
Dighe M, Bae U, Richardson ML, Dubinsky TJ, Minoshima S, Kim Y (2008) Differential diagnosis of thyroid nodules with US elastography using carotid artery pulsation. Radiology 248:662–669
Cantisani V, D'Andrea V, Biancari F et al (2012) Prospective evaluation of multiparametric ultrasound and quantitative elastography in the differential diagnosis of benign and malignant thyroid nodules: Preliminary experience. Eur J Radiol 81:2678–83
Cantisani V, Ulisse S, Guaitoli E, De Vito C, Caruso R et al (2012) Q-Elastography in the presurgical diagnosis of thyroid nodules with indeterminate cytology. PLoS ONE 7:e50725
Park SH, Kim SJ, Kim EK, Kim MJ, Son EJ, Kwak JY (2009) Interobserver agreement in assessing the sonographic and elastographic features of malignant thyroid nodules. AJR Am J Roentgenol 193:W416–23
Cooper D, Doherty GM, Haugen BR et al (2009) Revised American Thyroid Association management guidelines for patients with thyroid nodules and differentiated thyroid cancer. Thyroid 1911:1167–214
Landis J, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174
Fleiss JL (1986) The design and analysis of clinical experiments. Wiley, New York, pp 1–32
Bojunga J, Herrmann E, Meyer G et al (2010) Real-time elastography for the differentiation of benign and malignant thyroid nodules: a meta-analysis. Thyroid 20:1145–1150
Moon HJ, Sung JM, Kim EK, Yoon JH, Youk JH, Kwak JY (2012) Diagnostic performance of gray-scale US and elastography in solid thyroid nodules. Radiology 262:1002–13
Merino S, Arrazola J, Càrdenas A et al (2011) Utility and interobserver agreement of ultrasound elastography in the detection of malignant thyroid nodules in clinical care. Am J of Neuroradiol 32:2142–2148
Ragazzoni F, Deandrea M, Mormile A et al (2012) High diagnostic accuracy and interobserver reliability of real-time elastography in the evaluation of thyroid nodules. Ultrasound Med Biol 38:1154–1162
Lim DJ, Luo S, Kim MH, Ko SH, Kim Y (2012) Interobserver agreement and intraobserver reproducibility in thyroid ultrasound elastography. Am J Roentgenol 198:896–901
Acknowledgements
The authors would like to thank Dr. Corrado De Vito for his evaluable contribution to the statistical elaboration of our results.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Cantisani, V., Grazhdani, H., Ricci, P. et al. Q-Elastosonography of Solid Thyroid Nodules: Assessment of Diagnostic Efficacy and Interobserver Variability in a Large Patient Cohort. Eur Radiol 24, 143–150 (2014). https://doi.org/10.1007/s00330-013-2991-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00330-013-2991-y